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(57) Abstract 

The invention provides a method for the prediction of the binding affinity of a peptide to a major histocompatilibity (MHC) class 
II molecules comprising; 1) ascertaining the characteristics of a MHC molecule binding groove, 2) presenting a selected peptide to the 
MHC molecule and ascertaining a first conformation score for each pocket bound peptide side-chain, 3) amending the conformation of each 
pocket bound peptide side-chain and ascertaining a second conformation score, 4) repeating step 3 with alternative conformations of each 
peptide pocket bound side-chain, 5) choosing the highest conformation score for each pocket bound peptide side-chain in each binding 
groove pockets, herein known as "the pocket", and 6) combining the highest conformation score for each pocket and ascertaining a binding 
score for the complete peptide. 
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IDENTIFICATION OF MHC BINDING PEPTIDES 



The present invention relates to a new method for the 
prediction of peptides which bind to major histocompatibility 
5 (MHC) class II molecules and to molecules created or modified 
through the use of these methods. 

The immune system of the mammalian organism principally 
comprises two arms, the cellular immune system and the humoral 

10 or antibody-associated immune system. The cellular immune 
system is centred around the activity of T cells. There are 
two major classes of T cells, cytotoxic T lymphocytes (CTLs) 
which attack cells displaying foreign antigen completed with 
MHC class I molecules, and helper T cells which react to cells 

15 displaying foreign antigens in a complex with MHC class II 
molecules resulting in the secretion of cytokines which can 
activate B cells to produce antibody molecules. 

Humans express six different MHC class I genes and six 

20 different MHC class II genes, which are located on three 
highly polymorphic loci. This leads to considerable allelic 
variation in MHC molecules. The MHC class I consist of a a- 
chain and a ^-microglobulin, the a-chain is split into three 
domains a lf a 2 and oc 3 . a x and oc 2 form the MHC class I binding 

25 groove which contains pockets that bind the side chains and 
the amino and carboxy termini of any peptide present in the 
groove. The MHC class II molecules comprise an a-chain and 
a p -chain, it is the a x and ^ domains which create the MHC 
class II binding groove. The MHC class II binding groove also 

30 contains pockets but it does not bind the end termini of the 
peptide. For this reason the peptides bound by the MHC class 
II molecule can be longer and of a more variable length. The 
typical length of peptides complexed with a MHC class I or a 
MHC class II molecule are 8-10 amino acids and 13-20 amino 

35 acids, respectively. 

At present only three MHC class II structure are available but 
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it is believed that the backbone structure of all MHC class 
II alleles presently identified are similar to that of HLA- 
DR1. Structures of different alleles can be predicted by 
using homology modelling. This involves identifying the amino 
5 acid differences near the binding groove and using a computer 
to change the conformation of the side-chains to give 
favourable steric and electrostatic arrangements and to make 
the pockets as large as possible. The end result is a three 
dimensional structure of a MHC class II molecule, which can 
•10 be used in various experiments. 

The ability to predict the peptides in a protein which can 
bind to a given MHC molecule has great value especially for 
medical applications. it is known, for example, that in 
15 certain auto-immune diseases, T cells react with self -peptides 
presented by MHC class II molecules, it would be valuable to 
predict which peptides from auto-immune proteins are presented 
by MHC class II molecules in these diseases as well as to 
predict the binding of analogues of these peptides synthesised 
20 as potential antagonists for the presentation of self- 
peptides. m the selection of peptides for synthetic 
vaccines, the ability to predict MHC class II binding peptides 
would be advantageous. m addition, where heterologous 
proteins are developed as medicines or diagnostic imaging 
agents, it would be advantageous to predict potential MHC 
class II binding peptides in order to eliminate these from the 
heterologous proteins before administration to patients. 

While studies of peptides complexed with MHC class I molecules 
have revealed conserved "anchor" residues at certain positions 
within the presented peptides, such studies with peptides 
complexed with MHC class II molecules have been less 
successful mainly because of the greater length variability 
of such peptides and the consequent difficulty in aligning 
35 their sequences. 

Methods for accurately predicting the binding potential of 



25 



30 
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peptides have been restricted to MHC class I interaction with 
a peptide. In one method using three-dimensional structures 
of MHC class I molecules, peptide binding is ranked in 
ascending order according to the energy values determined. 
5 This method requires that the MHC structure be known, or that 
there is an obvious molecular model for the MHC structure. 
An identical method is said to be available for MHC class II 
but it does not consider the longer average length of the 
peptide and the open-ended peptide binding groove of MHC class 
10 II molecules. Neither does it use the best potential 
conformation of peptide amino acid side-chains and, therefore 
the binding energies calculated are only approximations. 

Another drawback of using the same method for MHC class I and 
15 MHC class II peptide binding is that the binding of peptides 
to MHC class II is less dependant on strict allele-specif ic 
binding motifs than peptides binding to MHC class I. 
Individual amino acids in the peptide play a more significant 
role in MHC class II binding than MHC class I such that the 
20 conformation of amino acid side-chains is proportionally more 
important to the accuracy of binding analysis. Therefore, 
known methods do not provide a general method for analysing 
the binding of peptides to three-dimensional structures of MHC 
class II. There is thus a need for improved methods for 
25 predicting the MHC class II binding potential of peptides. 

An object of this invention is to provide a method for 
accurately predicting the binding affinity of a peptide 
fragment binding to a MHC class II molecule. 



30 



35 



Another object of this invention is to provide a computer 
conditioned to perform the task of predicting the binding 
affinity of a peptide fragment binding to a MHC class II 
molecule. 

A yet further object of this invention is to provide a vaccine 
derived from the peptide fragment whose binding affinity to 
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pharmaceutical composition 
5 binding affinity to MHC 
determined. 
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been determined. 

invention is to provide a 
which comprises a peptide whose 
class II molecules has been 



According to the first aspect of this invention, there is 
provided a method for the prediction of the binding affinity 
10 of a peptide and a major histocompatibility (MHC) class II 
molecules comprising; 

1) ascertaining the characteristics of a MHC molecule binding 
groove , 

2) presenting a selected peptide to the MHC molecule and 
15 ascertaining a first conformation score for each pocket bound 

peptide side-chain, 

3) amending the conformation of each pocket bound peptide 
side-chain and ascertaining a second conformation score, 

4) repeating step 3 with alternative conformations of each 
20 peptide pocket bound side-chain, 

5) choosing the highest conformation score for each pocket 
bound peptide side-chain, 

6) combining the highest conformation score for each pocket- 
bound peptide side-chain and then ascertaining a binding score 

25 for the peptide. 

It is particularly desirable to then compile information on 
all peptide fragments in a protein and compare the binding 
scores, it is preferable if the conformation of the backbone 
30 of the peptide fragment is also altered and the conformation 
score and the binding score is then reassessed. 

The method of this invention thus involves assessing a binding 
score for all possible candidate peptides by considering the 
35 predicted three-dimensional conformations and interactions 
between the MHC and the peptide in the complex. The computed 
score indicates the predicted binding affinity for the 
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particular peptide binding with the MHC allele and can be used 
to predict whether the peptides are likely to bind, or not. 

Preferably, the conformation score for each pocket bound 
5 peptide side-chain is ascertained by considering at least one 
of the following parameters: 

a) the steric overlap between the pocket bound peptide residue 
bound in the pocket and an atom forming the pocket; this is 
value B, 

10 b) the number of hydrogen bonds which can be formed between 
the pocket bound peptide residue and an atom forming the 
pocket; this is value C, 

c) the strength of electrostatic interactions between any 
polar atoms of the pocket bound peptide residue and any polar 

15 atoms forming the pocket; this is value D, and 

d) the number of favourable contacts between the pocket bound 
peptide residue and the MHC residues forming one of the 
pockets; this is value E. 



20 The conformation score for each peptide is computed based upon 
the predicted atomic interactions between each of the pocket 
bound peptide residues and MHC pockets. The geometric 
constraints imposed on the peptide by the shape of the MHC 
binding groove play an important part of the scoring function. 

25 Favourable packing arrangements between peptide and MHC side- 
chains are rewarded by the scoring function, whilst 
arrangements involving steric overlap are penalised. 
Alternative conformation are tried for MHC residues if an MHC 
residue overlaps with a peptide side chain. 

30 

If no preferable conformation can be found the MHC side-chain 
is returned to its original conformation. In the event of 
. more than a pocket residue side-chain overlapping with a 
pocket bound peptide side chain, the pocket residue side 
35 chains are adjusted in order of overlap severity, with the 
pocket residue side-chain which has the most severe overlap 
being adjusted first. 



PCT/GB98/01801 
- 6 - 

In preferred embodiments the steric overlap between the pocket 
bound peptide residue and the atoms forming the pocket can not 
be greater than 0.35 Angstroms, otherwise the residue is 
deemed unable to fit in the pocket. 

5 

Conveniently a favourable contact occurs when an atom from an 
MHC residue and an atom from the peptide residue have their 
centres separated by no more than the sum of their radii plus 
0.5 Angstroms and are not overlapping. 

10 

Preferably the values B to E are imported into a first 
equation to give a conformation score (Z) . The first equation 
is Z n «(cK 2 C)-(cK 3 D) + (cK 4 E)-(cK 1 B) , where cK x to cK 4 are 
constants and n is the number of the pocket. 

15 

The value of cK a is between 50 and 150. Preferably between 75 
and 12 5. 

The value of cK 2 is between 1000 and 2000. Preferably between 
20 1250 and 1750. 

The value of cK 3 is between 250 and 750. Preferably between 
350 and 650. 

25 The value of cK< is between 500 and 1500. Preferably between 
750 and 1250. 

Conveniently the Z n value for a pocket is multiplied by a 
coefficient, L, depending on the pockets importance in 

3 0 binding, to give a second Z n value. The value L is in the 
range of 0.001 to 5. Larger pockets are considered more 
important in determining which peptide can bind, compared with 
the other smaller pockets, so the scores contributed by each 
pocket are weighted in proportion to the amount of the peptide 

35 side-chain buried by the surface of the MHC molecule. When 
binding to MHC class II molecules, peptides have shown high 
similarity in the degree to which their side-chains are buried 
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by the MHC surface, despite having dissimilar sequences. 

Preferably all the Z n values are summed to give a value J. 
Value J is the overall contributing score of all the pockets 
| for a certain conformation of the peptide fragment. 

Conveniently the MHC residue is paired with the pocket-bound 
peptide residue if an atom from the MHC residue and an atom 
from the pocket-bound peptide residue have their centres 
10 separated by no more than the sum of their van der Waal radii 
plus one Angstrom, 

In a preferred embodiment a value A n is calculated by summing 
the pairwise interaction frequencies of paired residues. As 
15 for the Z n value, preferably the value A n for a pocket is 
multiplied by a coefficient, X, depending on the pockets 
importance in binding. Preferably X is between 0.001 and 5. 

Conveniently the A n value for the pockets are summed to give 
20 a value P. 



In a preferred embodiment the binding score is ascertained by 
at least one of the following parameters 

a) the number of groove-bound hydrophobic residues; this is 
25 value F, 

b) the number of non groove-bound hydrophilic residues; this 
is value G, 

c) the number of peptide residues deemed to fit within their 
respective binding pocket; this is value H. 

Preferably values F, G, H, J and P are imported into a second 
equation to give a first binding score, Y. 

Conveniently the second equation is Y=J*F 2 * (G*H+1) +P. 

35 

However, in the alternative, the term He, which evaluates the 
hydrophobicity of the pocket bound peptide side chains using 
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a hydrophobicity scale disclosed in Janin et al [1979] Nature, 
277 pg 4 91, can also be used to determine the Y value. 
Accordingly, Y=(bK 2 C) - (bK 3 D) + (bK,E) - (b^B) + (bifcHe) +P. The scale 
used in Janin et al to measure hydrophobicity has a range from 
5 -1.8 for lysine to 0.9 for cysteine. 

It is known that peptides having favourable 
hydrophobic/hydrophobic interactions with solvent and MHC 
atoms have a higher binding affinity. Accordingly, it is 
10 preferable to include the term He. 

The value of bKj is between 1 and 10. Preferably between 1 and 
5. 

15 The value of bK 2 is between 20 and 60. Preferably between 30 
and 50. 



The value of bK 3 is between 300 and 900. Preferably between 
450 and 750. 



20 



The value of bK„ is between 1 and 20. Preferably between 5 
and 15. 

The value of bK 5 is in between 1 and 800. Conveniently 
25 between 100 and 600. Preferably between 100 and 400. 

In a preferred embodiment determination of the conformation 
score and the binding score are repeated for each pocket and 
each conformation of the peptide residue in said pocket. The 
conformation of the peptide is altered by rotating a side 
chain of the peptide residue by a pre-determined amount. In 
this way all possible conformations of the peptide side-chain 
in the pocket can be studied and the best or most likely 
conformation can be chosen to obtain the binding score. 



30 



35 



The conformation of the backbone of the peptide fragment is 
changed by modelling the conformation of the backbone on any 



WO 98/59244 PCT/GB98/01801 

- 9 - 

one of 167 backbones which have been previously generated, 
based on human and murine crystal lographic structures of MHC 
class II peptide complexes. The backbone conformation and the 
conformation of the peptide fragment side chains are altered 
5 systematically until the conformation score and the binding 
score of every possible conformation has been determined. 

Conveniently the steps are repeated using different peptides 
from a protein. 

10 

In preferred embodiments the binding scores (Y) for different 
peptides are tabulated and compared. Peptides with the 
highest scores are predicted to have the highest binding 
affinity for the particular MHC allele. 

15 

In a preferred embodiment the method of determining the 
binding affinity of a peptide residue for an MHC class II 
molecule is used in the manufacture of a vaccine derived from 
a peptide identified by said method. 

20 

Preferably the method of determining the binding affinity of 
a peptide residue for an MHC class II molecule is used to 
remove potentially immunogenic sequences from a protein and 
thus reduce said proteins immunogenic ity when administered to 
25 an organism. 

Using the afore-detailed method it is possible to predict the 
peptides from an auto-immune protein which are presented by 
MHC class II molecules. Thereafter, it is possible to 

30 synthesise peptides which would be antagonists to the 
presentation of such peptides by the MHC class II molecules. 
It is also possible to determine any proteins in a vaccine 
containing heterologous proteins which might result in the 
stimulation of T cells due to their presentation on MHC class 

35 II molecules. These proteins could then be altered or removed 
depending on their function in the vaccine. 
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According to a second aspect of the invention there is 

provided a computer conditioned to receive information 

characterising a peptide bound to the MHC molecule and to 
utilise said information to perform a procedure having the 
5 following steps; 

1) ascertaining the characteristics of a MHC molecule binding 
groove ; 

2) presenting a selected peptide, which is selected by a 
predetermined program, to the MHC molecule and ascertaining 

•10 a first conformation score; 

3) amending the conformation of the peptide, by way of a 
predetermined program, and ascertaining a second conformation 
score; 

4) repeating step 3 with other conformations of the peptide; 
15 5) selecting the peptide conformation with the highest 

conformation score; and 

6) calculating the binding score from the conformation score. 



Preferably the above detailed procedure also includes a step 
20 (7) which comprises repeating steps 1-4 with other peptide 
fragments in the protein to generate information on all 
peptide fragments in a protein so that a comparison can be 
made of the strength of the binding between the peptide and 
the MHC molecule. 

25 

Conveniently the above detailed procedure further comprising 
a step (8) which comprises altering the conformation of the 
backbone of the peptide fragment. 

30 The use of a computer in such a task is important because 
there are hundreds of calculations to perform per peptide 
fragment. A computer conditioned to perform the task can 
systematically change the conformation of the side chains and 
the backbone of the peptide fragment while calculating the 

35 conformation score and the binding score. 



According to a third aspect of the invention there is provided 
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a pharmaceutical composition made by determining the binding 
affinity of a peptide for a MHC class II molecule. 

A pharmaceutical composition is thus engineered to contain a 
5 peptide which is presented by an MHC class II molecule and 
which therefore stimulates the bodies cellular immune system. 
Alternatively the pharmaceutical composition is engineered so 
that it does not include peptides which significantly 
stimulate the immune system. 

10 

The invention will now be described, by way of illustration 
only, with reference to the following examples, tables and 
figures accompanying the specification. 

15 Figure I shows a graphical representation of the binding score 
distribution of all 554 13-mer Influenza haemagglutinin 
peptides bound to HLA-DRB1*0101 . 

Figure 2 shows a graphical representation of the binding score 
20 distribution of all 554 13-mer Influenza haemagglutinin 
peptides bound to HLA-DRB1*0401. 

T3ble — 1 shows the value for all the factors required to 
determine the binding score for the 15 peptides from Influenza 
25 haemagglutinin which have the highest binding affinity for 
HLA-DRB1*0101. 

3&frl e 2 shows the value for all the factors required to 
determine the binding score for the 15 peptides from Influenza 
30 haemagglutinin which have the highest binding affinity for 
HLA-DRB1*04 01. 

I Tafrle 3 lists the sequence difference between HLA-DRB1*0101 
and HLA-DRB1*0401. 

35 

Tfrkle 4 shows the torsion angles of the mutated side chains 
in HLA-DRB1*0401. 
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Example i 

The following method was used to confirm that the peptide 
PKYVKQNTLKLAT, has a high affinity binding for the MHC 
molecule HLA-DRB1*0101. 
5 The conformation score was calculated as follows for an 
oligomeric peptide having thirteen amino acid residues, herein 
known as a 13-mer peptide: 

a) Calculate the steric overlap between the pocket bound 
10 peptide residue in the binding groove and an atom forming the 

pocket; this is value B. 

b) Count the number of hydrogen bonds which could be formed 
between the pocket bound peptide residue and atoms forming the 

15 pocket; this is value C. 

c) Calculate the strength of electrostatic interactions 
between any polar atoms of the pocket bound peptide residue 
and any polar atoms forming the pocket; this is value D. 

20 

d) Count the number of favourable contacts between the pocket 
bound peptide residue and atoms forming the pocket; this is 
value E. 

25 These values were then transformed into a conformation score 
(Z) by using the following equation: 

Z n =(cK 2 C) - (CK 3 D) + (cK 4 E) -(cK a B) 
where cK a to cK 4 are constants and n is the number of the 
30 pocket. CKi, cK 2/ cK 3 and cK, are equal to 100, 1500, 500 and 
1000 respectively. 

. The conformation of each rotatable side chain of the pocket 
bound peptide bound residue was then altered by 3 0° and the 
35 conformation score was recalculated. 

The above steps were repeated for each of the pockets and the 
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highest conformation score for each of the pockets was used 
to determine the binding score. 

The binding score was determined by establishing values for 
5 the following parameters: 

a) the number of groove-bound hydrophobic residues; this is 
value F. 

b) the number of non groove-bound hydrophilic residues; this 
is value G. 

10 c) the number of peptide residues deemed to fit within their 
respective binding groove; this is value H. 

The conformational scores for pockets one and five were 
doubled and then all the conformational scores were summed to 
15 give a value J. 

The above values were then imported in to the following 
equation in order to determine the binding score: 

20 J*F 2 *(G*H+1)+P 

The binding scores for all the 13-mer peptides from Influenza 
Haemagglutinin binding with MHC molecule HLA-DRB1*0401 were 
calculated and the resultant top 15 binding scores are 
25 presented in Table 1. PKYVKQNTLKLAT has the 8th highest 
binding affinity for HLA-DRB1*0101 from all 554 possible 
overlapping 13-mer peptides. 
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Table I 



5 



10 



15 



Rank 


Seq. 


Peptide 


Binding 
Score 


P 


B 


C 


D 


E 


F 


G 


H 


1 


328 


NTLKLATGMRNVP 


9382500 


15012 


0.00 


1 




27 


4 


6 


5 


2 


453 


IDLTDSEMNKLFE 


8288922 


17964 


0.72 


1 




40 


3 


6 


5 


3 


373 


NSEGTGQAADLKS 


7520420 


10661 


0.68 


0 


+0.01 


30 


4 


7 




4 


504 


HDVYRDEALNNRF 


7211042 


15527 


0.56 


1 


-0.05 


31 


3 


6 


5 


5 


119 


PD Y ASLRSLVAS S 


7174962 


17351 


0.68 


1 




40 


4 


4 


5 


6 


461 


NKLFEKTRRQLRE 


7049469 


19407 


0.79 


0 


+0.01 


56 


2 


7 


5 


7 


122 


ASLRSLVAS SGTL 


6922064 


16346 


0.09 


0 




25 


4 


4 


5 


8 


322 


PKYVKQNTLKLAT 


6765975 


18217 


1.82 


1 




56 


3 


5 


5 


9 


458 


SEMNKLFEKTRRQ 


6156822 


16617 


0.30 


4 


+0.08 


44 


2 


7 


5 


10 


513 


NNRFQIKGVELKS 


6096900 


14052 


1.32 


3 


-0.01 


30 


4 


7 


4 


11 


439 


YNAELLVALENQH 


5890199 


14198 


0.60 


1 




33 


4 


4 


5 


12 


63 


STGKICNNPHRIL 


5887908 


12776 


0.75 


5 


-0.05 


31 


3 


6 


5 


13 


50 


I E VTNATELVQS S 


5503551 


14297 


0.95 


2 


+0.06 


39 


3 


5 


5 


14 


262 


NSNGNLIAPRGYF 


5284475 


10102 


0.09 


1 




21 


4 


5 


5 


15 


257 


DVLVINSNGNLIA 


5239292 


17028 


1.35 


2 




35 


3 


4 


5 



Example ? 

A method as described in Example 1 was used to confirm that 
the peptide PDYASLRSLVASS from Influenza haemagglutinin, has 
25 high affinity binding for the MHC molecule HLA-DRB1*0401. 

The structure of HLA-DRB1*04 01 is not known but a three 
dimensional model was constructed based on the known structure 
of HLA-DRB1*0101 by homology modelling. io amino acid 
30 differences between the two molecules were identified (see 
Table 2) and HLA-DRB1*0101 was mutated using the molecular 
modelling package 'Quanta' to produce a model of HLA- 
DRB1*0401. 
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Then the side-chain conformations of the 10 amino acids were 
adjusted interactively. In most cases, torsion angles were 
chosen which resulted in little or no steric overlap between 
the mutated residues and surrounding atoms. m the case of 
5 non-conserved residues which were either charged or whose 
side-chains were able to form hydrogen bonds, the potential 
to form favourable interactions was also considered. The 
placement of 13H, 28D and 71K was such that these residues 
were able to form a favourable electrostatic arrangement 

10 whilst at the same time, having minimum steric overlap with 
surrounding atoms. in the case of 30Y, this residue was 
positioned such that its hydroxyl group was situated close to 
the side-chain of 9E, where a hydrogen bond may be formed. 
The torsion angles chosen for the 10 mutated amino acid 

15 residues were calculated in accordance with the standard 
conventions and are listed in Table 3. 

The binding scores for all 13-mer peptides from Influenza 
Haemagglutinin binding with MHC molecule HLA-DRB1*0401 'were 
calculated and the resultant top 15 binding scores are 
presented in Table 4. PDYASLRSLVASS has the 9th highest 
binding affinity for HLA-DRB1*0401 from all 554 possible 
overlapping 13-mer peptides. 



20 



25 
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Table 2 



Seq. Pos. 


HLA-DRB1*0101 


HLA-DRB1*0401 


b9 


Tryptophan 


Glutamic acid 


bll 


Leucine 


Valine 


bl3 


Phenylalanine 


Histidine 


b26 


Leucine 


Phenylalanine 


b28 


Glutamic acid 


Aspartic Acid 


b30 


Cysteine 


Tyrosine 


b31 


Isoleucine 


Phenylalanine 


b3 3 


Asparagine 


Histidine 


b37 


Serine 


Tyrosine 


b71 


Arginine 


Lysine 



Table 3 

15 



Residue 


cl 


C2 


C3 


c4 


b9 


-61° 


-71° 


-2° 




bll 


168° 








bl3 


-38° 


-63° 






b26 


170° 


57° 






b28 


-174° 


-15° 






b30 


-174° 


41° 






b31 


-119° 


-13° 






b33 


-95° 


-2° 






b3 7 


-116° 


-2° 






b71 


-97° 


-45° 


172° 


9° 
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Table 4 



5 



10 



15 



Rank 


Seq. 


Peptide 


Binding 
Score 


P 


B 


C 


D 


E 


F 


G 


H 


1 


453 


IDLTDSEMNKLFE 


3070823 


6559 


0.36 


0 




42 


3 


6 


5 


2 


373 


NSEGTGQAADLKS 


2988447 


4182 


0.36 


0 


+0.01 


32 


4 


7 


5 


i 3 


328 


NTLKLATGMRNVP 


2899375 


4639 


0.00 


1 




27 


4 


6 


5 


4 


122 


ASLRSLVASSGTL 


2894599 


6819 


0.03 


0 




24 


4 


4 


5 


5 


72 


HRILDGIDCTLID 


2820446 


4623 


0.60 


1 


+0.16 


28 


4 


6 


5 


6 


461 


NKLFEKTRRQLRE 


2662369 


7203 


0.36 


0 


-0.11 


50 


2 


7 


5 


7 


119 


PDYASLRSLVASS 


2616648 


6184 


0.11 


1 




32 


4 


4 


5 


8 


188 


DNFDKLYIWGIHH 


2615259 


5429 


0.58 


0 




29 


5 


6 


4 


9 


322 


PKYVKQNTLKLAT 


2515861 


6407 


0.46 


2 




44 


3 


5 


5 


10 


232 


NIGSRPWVRGLSS 


2488137 


4818 


0.41 


0 


-0.02 


35 


4 


5 


5 


11 


504 


HDVYRDEALNNRF 


2353661 


4965 


0.05 


1 


-0.07 


25 


3 


6 


5 


12 


135 


EFITEGFTWTGVT 


2208179 


3543 


0.07 


1 




20 


4 


5 


5 


13 


251 


TIVKPGDVLVINS 


2176819 


5259 


0.10 


0 




16 


5 


5 


4 


14 


257 


DVLVINSNGNLIA 


2107570 


6673 


0.71 


2 




40 


3 


4 


5 


15 


439 


YNAELLVALENQH 


2035430 


4795 


0.03 


1 




26 


4 


4 


5 



20 Example 3 

A library of backbones were constructed by examining the 
crystal structure of the HLA-DR1 complexed with SEB super- 
antigen. This results in a collection of homogenous peptides 
25 within the MHC binding groove. The atomic positions of the 
peptide backbone, as shown in the PDB file produced from the 
crystal, were considered to be the -representative' backbone 
conformation of a peptide which binds to HLA-DR1. 

30 Each of the peptide backbone conformations from the known MHC 
class II crystallographic structures are taken and after being 
transformed to the same frame of reference as the 
-representative' peptide had the differences between their 
Ca/Cfi positions and those of the -representative' peptide 
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calculated. These differences summarise the variability of 
Cot/cp atomic positions between the known peptides and the 
"representative' peptide. 

5 The differences were doubled to take into account the fact 
that the variability of peptides thus far crystallised may not 
fully represent the true variability of peptides binding to 
MHC class II molecules. The differences were then used to 
define regions within which peptide Ca and C(J atoms centres 
10 are constrained to lie. 



An exhaustive search was then made through candidate peptide 
backbones. Starting from the "representative 1 peptide 
candidates are generated by adjusting backbone <j> and * angles 

15 in ten degree steps from the N-terminus to the C-terminus. 
An adjustment was rejected if it led to any Ca or C|J atom 
centre being outside the allowed region, derived above. An 
adjustment which did not violate the constraint results in a 
new backbone conformation which is stored within the peptide 

20 backbone library. 



The y, and z co-ordinates of atoms in the backbones 
designated 0, 14, 62, 65, 75, 93, 104, 107, 112, 118, 129, 
134, 141, 144 are given in Tables 5 to 18. 
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Table 5 



Backbone o 



Atom 
Number 



Atom 
type 



Position 
in peptide 



0 
1 
2 
3 
4 
5 
6 
7 
8 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 

38 

39 

40 

41 



N 

CA 

C 

O 

CB 

N 

CA 

C 

0 

CB 

N 

CA 

C 

0 

CB 
N 

CA 

C 

0 

CB 
N 

CA 

C 

0 

CB 

N 

CA 

C 

0 

CB 

N 

CA 

C 

0 

CB 
N 

CA 
C 

0 ' 
CB 
N 

CA 



0 
0 
0 
0 
0 
1 
1 
1 
1 
1 
2 
2 
2 
2 
2 
3 
3 
3 
3 
3 
4 
4 
4 
4 
4 
5 
5 
5 
5 
5 
6 
6 
6 
6 
6 
7 
7 
7 
7 
7 
8 
8 



19.913 
19.472 
18.153 
18.200 
19.504 
• 16.984 
15.771 
15.262 
15.175 
14.663 
14.959 
14.414 
12.920 
12.384 
14.756 
12.283 
10. 866 
10.086 
10.560 
10.624 
8.951 
8.035 
6.945 
6.664 
7.330 
6.355 
5.266 
4.167 
4.342 
5.349 
3.044 
1.950 
1.050 
0.836 
1.163 
0.420 
-0.503 
-1.889 
-2.429 
-0.611 
-2.442 
-3.790 



86.191 

86.222 

85.531 

84.640 

87.660 

85.957 

85.316 

84.115 

84.127 

86.325 

83.055 

81.829 

82.131 

82.737 

80.548 

81.841 

82.097 

80.785 

79.730 

82.744 

80.855 

79.734 

79. 658 

80.648 

79.991 

78.499 

78.527 

78.292 

77.560 

77.437 

78. 938 

78.858 

77.758 

77.517 

80.226 

77.190 

76.102 

76. 607 

77.551 

75.340 

75.997 

76.330 



20.687 

22.078 

22.516 

23.352 

22.593 

22.044 

22.536 

21.770 

20.547 

22.743 

22.510 

21.926 

21.907 

22.840 

22.811 

20.784 

20.637 

20.839 

20.447 

19.230 

21.528 

21.814 

20.721 

20.044 

23.185 

20.461 

19.496 

20.475 

21.444 

18.471 

20.261 

21.205 

20.856 

19.690 

21.247 

21.863 

21.660 

21.227 

21.833 

22.937 

20.167 

19.644 
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Table 5 continued 



5 



10 



15 





Ti 4* S~\ TTI 

Atom 


position 


X 


y 


z 


Number 


type 


in peptide 








42 


C 


8 


-4 . 839 


75. 618 


20.504 


43 


0 


8 


-4 . 505 


74 . 687 


21.236 


44 


CB 


8 


-3. 924 


75.908 


18 . 149 


45 


N 


9 


-6. 093 


76. 041 


20.436 


46 


CA 


9 


-7 . 113 


75.382 


21.236 


47 


c 


9 


-7. 976 


74 . 424 


20.403 


48 


0 


9 


-8.366 


74.742 


19.266. 


49 


CB 


9 


•-7.963 


76.413 


21.973 


50 


N 


10 


-8.203 


73.232 


20.971 


51 


CA 


10 


-8. 995 


72.149 


20.365 


52 


C 


10 


-10.492 


72.527 


20.200 


53 


0 


10 


-10 . 962 


73.563 


20.702 


54 


CB 


10 


-8.830 


70.835 


21.191 


55 


N 


11 


, -11.238 


71.661 


19.523 


56 


CA 


11 


-12.654 


71.907 


19.270 


57 


C 


11 


-13.603 


71.483 


20.395 


58 


0 


11 


-13.661 


70.302 


20.800 


59 


CB 


11 


-13.072 


71.269 


17.940 


60 


N 


12 


-14.360 


72.481 


20.852 


61 


CA 


12 


-15.363 


72.337 


21.898 


62 


C 


12 


-14.758 


72.166 


23.281 


63 


0 


12 


-14.785 


71.069 


23.853 


64 


CB 


12 


-16.320 


71.168 


21.577 
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Table 6 









Atom 


Atom 


Position 


X 


y 


2 




type 


in peptide 








0 


N 


U 


n nnn 


u.UUU 


n nnn 


i 


CA 


n 
U 


1 R 9R1 

-L O • Z 0 1 


P £97 
O D • yj / 


99 a n ^ 
ZZ . 4Uj 


2 


C 


u 


K 7 QQ 


OD. / OO 


9 9 71 R 
ZZ • / 1 3 


3 


0 


n 
U 


i £ 9 ^n 

ID* Z D U 


P7 o o n 


9 9 "7 9 n 
ZZ . / ZU 


4 


CB 


u 


n nnn 


n nnn 
U • uuu 


n nnn 
U • UUU 


5 


N 


1 


1 £ 17/1 
iO. 1 / 4 


oO. OU1 


9 9 ft 9 1 

zz . y Jl 


6 


CA 


1 


1 fi • / CO 


Q C CC-J 

O J . jjj 


OI ft ft "7 

z J . zo / 


7 


C 


1 


1 / n QQ 


Q y| ft ft O 

04 . jy j 


ft ft czrn 

zz . boy 


8 


0 


1 


i 9 n<^9 


Q A CQQ 


ft 1 ft A O 

zi . yuo 


9 


CB 


1 


i 4 nan 
1 fl . u^u 


Q £T O A C 
OO.OHD 


ft ft O £ ft 

zz . o69 


10 


N 


Z 


1 A "7 9 9 
14. / Z J 


Q "2 O ft ft 
O O . ZZ J 


ft ft C O ft 

22 . 680 


11 


CA 


z 


Id 1 Q9 
14 . 1 o Z 


oz . Ul 6 


o ft n q *5 

ZZ . uy 6 


12 


C 


ft 


19 CCQ 

iz . o j y 


oz . 1 o4 


ft i ft ft i 
21 . 901 


13 


0 


Z 


11 Qc:o 

ii . y oz 


Q ft /I 11 

oz . 4 Jl 


ft ft Q Q A 

zz . oo4 


14 


CB 


ft 
Z 


i 4 a 7 n 


O U • O Z 3 


ft ft ft ft A 

zz . y y4 


15 


N 


o 


X Z • Z 1 z 


p 9 n 9 9 

o z . uz z 


ft ft d A Ck 

zu . oh y 


16 


CA 




x vj • oi j 


o 9 np £ 

O Z • UuO 


9n *^i 7 

ZU . ji / 


17 


C 


o 


10 21 9 


Rn 6R i 

O VJ • OO J. 


on 4 9^ 

Z U • IZ J 


18 


0 


9 

•j 


10 898 

X \J m VJ .J VJ 


7 9 694 


20 i m 

VJ • X VJ X 


19 


CB 


-5 
J 


10 . 669 


82 621 

VJ fc. . VJ 4U X 


1 R Q06 

X O • ^ VJ vj 


20 


N 


4 


8 . 980 


80 660 

VJ VJ • VJ VJ VJ 


20 898 

£- VJ . VJ ^/ VJ 


21 


CA 


4 

* 


8 .245 


79.430 


21 010 

X • VJ X VJ 


22 


C 


4 


6 . 863 


79 . 586 


20 344 


23 


0 


4 

* 


6.283 


80 680 

v v . vj \j v 


20 413 


24 


CB 


4 


8. 071 


79.059 


22 . 472 


25 


N 


5 


6. 427 


78 . 504 


19.710 


26 


CA 




5 . 135 


78.479 


19 082 


ft n 

27 


C 


5 


4 . 084 


77 . 942 


20 074 

t. \J • VJ f *1 


ft ft 

28 


0 


«J 


4 . 171 


76.770 


20 4 68 

VJ . T VJ VJ 


ft r\ 

29 


CB 




5 174 


77 593 


17 84 8 

X / . VJ T VJ 


30 


N 


6 

VJ 


3 . 174 


78 832 


20 4 S2 

VJ . T «J 4b. 


31 


CA 


a 

Vj 


2 100 


78 470 

' VJ . *1 / \J 


21 *}^6 


32 


C 


VJ 


134 9 


77 24 R 


90 7 6Q 


33 


0 


C 

o 


1 70*} 

X • / VJ o 


7fi 776 
/ o • / / o 


1 Q 67Q 


34 


CB 


6 


1.139 


79.635 


21.492 


35 


N 


7 


0 381 

VJ • «J VJ J. 


76 7R1 

/ VJ • /ox 


2i ssn 

X • *J »J VJ 


36 


CA 


7 


-0.441 


75.677 


21.137 


37 


C 


7 


-1.906 


76.139 


21.008 


38 


0 


7 


-2.505 


76.533 


22.020 


39 


CB 


7 


-0.346 


74.551 


22.153 


40 


N 


8 


-2.392 


76.101 


19.773 


41 


CA 


8 


-3.758 


76.454 


19.498 


42 


C 


8 


-4.704 


75.537 


20.299 


43 


0 


8 


-4.316 


74.404 


20.618 


44 


CB 


8 


-4.043 


76.313 


18.013 
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Table 6 continued 



Atom 
Number 



Atom 
type 



Position 
in peptide 



45 
46 
47 
48 
49 
50 
51 
52 
53 
54 
55 
56 
57 
58 
59 
60 
61 
62 
63 
64 



N 

CA 

C 

0 

CB 
N 

CA 
C 

o • 

CB 
N 

CA 

C 

O 

CB 

N 

CA 

C 

0 

CB 



9 
9 
9 
9 
9 

10 
10 
10 
10 
10 
11 
11 
11 
11 
11 
12 
12 
12 
12 
12 



-5. 


873 


76 


084 


20 


. uxu 


-6. 


881 


75 


338 


21 


• Jl J 


-7 . 


500 

V \J 


74 


285 

t O J 


2D 


371 


-7 . 


243 


74 


ooo 


1 Q 




-7 


964 

^ \J *i 




97 5 


9 1 


PI P 


-8 


250 


73 


379 


90 


Q7 P 


-8 


934 


7? 


3 54 


9fi 


99 Q 






7 9 


/ O O 


1 Q 


Q"7 £ 

. y / o 


-11. 


075 


73. 


192 


20 


.928 


-8. 


914 


71. 


043 


20 


. 996 


-10. 


781 


72. 


710 


18 


.708 


-12. 


127 


73. 


032 


18 


.320 


-13. 


058 


71. 


846 


18 


.640 


-13. 


254 


70. 


984 


17 


.770 


-12. 


180 


73. 


341 


16 


.834 


-13. 


551 


71. 


844 


19 


.872 


-14 . 


474 


70. 


830 


20 


.305 


0. 


000 


-12. 


127 


73 


.032 


18. 


356 


0. 


000 


-12 


.127 


0. 


000 


0. 


000 


0 


.000 
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Table 7 



Backbone 62 




Atom 


Atom 


Position 


v 


y 


z 


XMuiTiDer 


type 


in peptide 








0 


N 


0 


0 000 


n nnn 


n n n n 
U • UUU 


1 


CA 


0 


18 .315 


8 6 Q7 1 


9 9 

zz • o y b 


2 


c 


0 


16.796 


86 97Q 


9 9 AHA 

Z Z . H U H 


3 


0 


0 


16. 173 


87 867 


91 *7 on 

Z x . / o U 


4 


CB 


0 


0 . 000 


0 000 


n nnn 

u • UUU 


5 


N 


1 


16.231 


85 97Q 


9 o n"7 
Z J . u / D 


6 


CA 


1 


14 .791 


85 876 


90 91 £ 


7 


C 


1 


14.286 


84 6fio 


9 9 A 1 
ZZ . *i O ± 


8 


0 


1 


13 . 659 


R4 R90 
O 1 • o z u 


91 oon 
Z ± . J oU 


9 


CB 


i 


14 . 132 


87 1 93 


9 9 C^O 
Z Z . oOz 


10 


N 


2 


1 4 595 


81 4 R7 


9 9 QQQ 

zz . y o y 


11 


CA 


2 


14.144 




99 n /i 
ZZ . hkjh 


12 


C 


2 


12 614 


R? PRO 


9 9 Ol *5 

zz . z lz 


13 


0 


2 


"5 1 890 


0 Z . *i JO 


O O 1 ft c. 

z J . 195 


14 


CB 


2 


14 SIR 


R 1 H"7"7 


O O "3 AC 


15 

X «s 


N 


•J 


1 9 OOP 


oo i no 
. lUo 


20. 960 


16 


CA 


3 


in pin 

XU . OJ.U 


oz . U / 1 


20. 629 


17 


c 


3 


in opq 

x \J . Z O 3? 


on coo 
oU . bz J 


20 . 734 


18 

X vj 


0 


«j 


1 1 i n^ 




20.783 


19 


CB 






0 9 c Q n 
OZ . O y 1 


19.218 


20 


N 


4 


o • y o / 


on ^i/i 
oU . 014 


ft ft o ft ^ 
20 . 800 


21 

£• X 


CA 


4 


R 7 9R 


TQ ft O O 

/ y . zzo 


20 . 852 


22 


c 


4 


6 R61 


7 0 o ^ c 
' y • o o o 


d n *3 ft c 
zU . 395 


23 


0 


4 


6 157 


fin 9 

O U . Z O D 


Oft Q "7 ^ 

zU . o / b 


24 


CB 


4 


8 . 377 


7fl fifln 


9 9 OCO 
Z Z - Z DO 


25 


N 


5 


6.490 


78 47R 

'Oil / Q 


iq 4in 


26 


CA 


5 


5.140 


78 440 


1 R Q7 R 


27 


C 


5 


4.171 


78 141 

' v . x t x 


70 13 0 

Z U • X O Z7 


28 


0 


5 


4 . 543 


77 392 


°i n^^ 

Z x • Uj J 


29 


CB 


5 


5. 006 


77 369 


17 QOQ 
x / • y u y 


30 


N 


6 


3.002 


78 . 765 


^ \J ■ u o u 


31 


CA 


6 


1.975 


78 . 549 


?1 049 


32 


C 


6 


1.039 


77.416 


20 577 


33 


0 


6 


1.276 


76.842 


19.503 


34 


CB 


C 
D 


1 1 "7 A 

1 . 174 


79. 824 


21.246 


35 


N 


7 


0.052 


77.131 


21.418 


0 £ 


CA 


7 


-0.931 


76.132 


21.102 


37 


C 


7 


-2.325 


76.784 


21.008 


38 


0 


7 


-2.553 


77.814 


21.661 


39 


CB 


7 


-0.941 


75.055 


22.174 


40 


N 


8 


-3.166 


76.177 


20.179 


41 


CA 


8 


-4.518 


76.638 


20.020 


42 


C 


8 


-5.491 


75.631 


20.666 


43 


0 


8 


-5.155 


74.441 


20.754 
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TabLe 7 continued 



Atom 


Atom 


Position 


X 


y 


z 


Number 




•Lit ^CpUXUC 








44 




8 1 


-4 .845 


76.793 


18. 545 


45 


IN 


9 


-6. 623 


76.163 


21.113 


46 




9 


-7.650 


75.345 


21. 696 


47 


r 


9 


-3.161 


74.329 


20. 655 


48 


yj 


9 


-8.197 


74.658 


19.460 


49 




9 


-8.802 


76.215 


22.170 


50 


XT 

IN 


10 


-8.492 


73.143 


21. 153 


J X 


LA 


10 


-9.030 


72.107 


20. 315 


52 




10 


-10.518 


72.390 


20 . 029 


53 




10 


-11.258 


72.730 


20. 964 


1 54 


pe 


10 


-8.887 


70.758 


21 . 000 


55 


K7 

w . 


11 


-10.869 


72.271 


18 . 754 


56 


CA 


11 


-12.232 


72.455 


18.336 


57 




11 


-13.047 


71 . 182 


18 64 1 

-L O • Oil 


58 


0 


11 


-13.155 


70.312 


17.764 


59 


CB 


11 


-12.284 


72.752 


16.847 


60 


N 


12 


-13.544 


71.124 


19.871 


61 


CA 


12 


-14.366 


70.022 


20.291 


62 


C 


12 


0.000 


-12.232 


72.455 


63 


0 


12 


18.332 


0.000 


-12.232 


64 


CB 


12 


0.000 


0.000 


0.000 
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Table 8 
Backbone 65 



Atom 
Number 



Atom 
type 



Position 
in peptide 



0 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
38 
39 
40 
41 
42 



N 

CA 

C 

0 

CB 
N 

CA 

C 

0 

CB 

N 

CA 

C 

0 

CB 

N 

CA 

C 

0 

CB 

N 

CA 

C 

0 

CB 
N 

CA 

C 

0 

CB 

N 

CA 

C 

0 

CB 
N 

CA 

C 

0 

CB 
N 

CA 
C 



0 

0 

0 

0 

0 

1 

1 

1 

1 

1 

2 

2 

2 

2 

2 

3 

3 

3 

3 

3 

4 

4 

4 

4 

4 

5 

5 

5 

5 

5 

6 

6 

6 

6 

6 

7 

7 

7 

7 

7 

8 

8 

8 



0. 


000 


0. 


000 


0. 


000 


18. 


487 


86. 


641 


22. 


418 


16. 


990 


86. 


870 


22. 


533 


16. 


510 


87. 


999 


22. 


287 


0. 


000 


0. 


000 


0. 


000 


46. 


279 


85. 


796 


22. 


868 


14. 


844 


85. 


866 


23. 


065 


14. 


178 


84. 


664 


22. 


417 


13. 


234 


84. 


830 


21. 


612 


14. 


301 


87. 


132 


22. 


424 


14. 


699 


83. 


484 


22. 


746 


14. 


144 


82. 


241 


22. 


248 


12. 


616 


82. 


381 


22 . 


089 


11. 


950 


82. 


822 


23 . 


038 


14. 


457 


81. 


109 


23 . 


212 

m» A *m 


12. 


150 


82. 


035 


20 . 


895 


10. 


742 


82. 


065 


20. 


608 


10. 


206 


80. 


624 


20. 


484 


10. 


895 


79. 


773 


19. 


902 


iO. 


491 


32. 


815 


19. 


314 


9. 


029 


80. 


419 


21. 


065 


8. 


376 


79. 


140 


20. 


993 


6. 


930 


79. 


322 


20. 


491 


6. 


309 


80. 


350 


20. 


801 


8. 


365 


78. 


486 


22. 


364 


6. 


484 


78. 


339 


19. 


718 


5. 


139 


78. 


340 


19. 


212 


4. 


150 


78. 


069 


20. 


363 


4 . 


487 


77. 


306 


21. 


280 


4 . 


985 


77. 


274 


18. 


142 


3. 


002 


78. 


731 


20. 


275 


1. 


959 


78. 


547 


21. 


246 


0. 


861 


77. 


634 


20. 


665 


0. 


752 


77. 


533 


19. 


433 


1. 


360 


79. 


890 


21. 


628 


0. 


134 


76. 


994 


21. 


573 


-0. 


959 


76. 


143 


21. 


187 


-1. 


983 


76. 


952 


20. 


366 


-1. 


708 


78. 


116 


20. 


039 


-1. 


631 


75.569 


22. 


422 


-3. 


087 


76. 


287 


20. 


048 


-4 . 


156 


76. 


921 


19. 


326 


-5. 


496 


76.242 


19.676 
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Table 8 continued 



Atom 


Atom 


Position 




y 




Number 


type 


in peptide 








A *"> 

43 


r\ 
\J 


Q 
0 


-6.146 


7 S 6Q9 


XO . / to 


44 




Q 

o 


-3 . 906 


7 6 ft?n 




45 j 


VI 


Q 


-5 817 






A £L 
4 O 




Q 


-7 .058 


75 736 


21 d Q 


47 




Q 


-7 . 606 


7 4 721 


• H X O 


A O 

4 o 






-7 . 311 


74 855 


1 Q 9 1 Q 


A Ci 

4 9 




Q 


-8 . 071 


7 6 84 Q 


91 £ A Q 


50 


M 


xu 


-8 . 339 


73 74 6 


9 fi Qil n 
^ U . 17 *i U 


51 


CA 


10 


-8 . 959 


72 ,751 


20 . 108 


52 


C 


1.0 


-10 . 421 


73 . 147 


19. 824 


53 


0 


10 


-10 . 685 


73.773 


18 . 787 


54 


CB 


10 


-8 . 919 


71.398 


20.799 


55 


N 


11 


11 O A /( 

-11.294 


72 . 734 


20 .735 


56 


CA 


11 


-12.689 


73.067 


20.635 


57 


C 


11 




*7i q 

/ X • d DU 


on not; 


58 


0 


11 


-13.031 


71.253 


19.099 


59 


CB 


11 


-12.873 


74.262 


19.715 


60 


N 


12 


-14.572 


71.556 


20.766 


61 


CA 


12 


i -15.436 


70.486 


20.348 


62 


C 


12 


0.000 


-12.689 


73.067 


63 


0 


12 


18.675 


0.000 


-12.689 


64 


CB 


12 


0.000 


0.000 


0.000 
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Table 9 



Backbone 75 




Atom 




rubition 


X 


y 


z 


Number 


type 


in peptide 








0 


N 


0 


0.000 


0.000 


0.000 


1 


CA 


0 


18.442 


86.539 


22.377 


2 


C 


0 


16.947 


86.419 


22.136 


3 


0 


0 


16.452 


86.839 


21.066 


4 


CB 


0 


0.000 


0.000 


0.000 


5 


N 


1 


16.265 


85.822 


23.109 


6 


CA 


1 


14.823 


85.676 


23.048 


7 


C 


1 


14.466 


84.417 


22.277 


8 


0 


1 


14.197 


84.487 


21.057 


9 


CB 


1 


14.218 


86.875 


22.338 


10 


N 


2 


14.505 


83.290 


22. 985 


11 


CA 


2 


14 . 144 


82.013 


22. 404 


12 


C 


2 


12. 615 


81 . 942 


22.214 


13 


0 


2 


11 . 895 


81.727 


23. 200 


14 


CB 


2 


1 A /" f\ 1 

14 . 601 


80.882 


23.308 | 


15 


N 


3 


12 . 201 


82 . 159 


20. 971 


16 


CA 


3 


10 . 808 


82 . 078 


20. 626 


17 


C 


3 


10.331 


80, 615 


20. 726 


18 


0 


3 


11.176 


79. 709 


20. 772 


19 


CB 


3 


10.592 


82 . 592 


19.213 


20 


N 


4 


9.013 


80.465 


20.789 


21 


CA 


4 


8.414 


79. 160 


20.836 


22 


C 


4 


6.944 


79.245 


20.377 


23 


0 


4 


6.322 


80.304 


20.544 


24 


CB 


4 


8.478 


78.609 


22.251 


25 


N 


5 


6.482 


78.145 


19.793 


26 


CA 


5 


5.116 


78.053 


19.354 


27 


C 


5 


4.181 


77.969 


20.577 


28 


0 


5 


4.609 


77.470 


21.629 


29 


CB 


5 


4.932 


76.823 


18.483 


30 


N 


6 


2.974 


78.490 


20.389 


31 


CA 


6 


1.974 


78.445 


21.420 


32 


C 


6 


0.736 


77.679 


20.910 


33 


0 


6 


0.349 


77.867 


19.748 


34 


CB 


6 


1.576 


79.855 


21.821 


35 


N 


7 


0.206 


76.836 


21.788 


36 


CA 


7 


-0.980 


76.086 


21.478 


37 


C 


7 


-1.844 


76.872 


20.470 


38 


0 


7 


-1.448 


77.977 


20.071 


39 


CB 


7 


-1.778 


75.828 


22.745 


40 


N 


8 


-2.952 


76.249 


20.088 


41 


CA 


8 


-3.885 


76.873 


19.189 
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Atom 
Number 



42 
43 
44 
45 
46 
47 
48 
49 
50 
51 
52 
53 
54 
55 
56 
57 
58 
59 
60 
61 
62 
63 
64 



Atom 
type 



Position 
in peptide 



c 


8 


0 


8 


CB , 


8 


N 


q 


CA 


Q 


c 


q 


0 


Q 

-7 


CB 


Q 


N 


i n 
1U 


CA 


10 


C 


10 


0 


10 


CB 


10 


N 


11 


CA 


11 


C 


11 


0 


11 


CB 


11 


N 


12 


CA 


12 


C 


12 


0 


12 


CB 


12 



X 




y 




z 




-5. 


324 


76. 


483 


19 


.579 


-6. 


195 


76. 


435 


18 


.698 


-3 . 


604 


76. 


435 


17 


.762 


-5 . 


491 


76. 


194 




ft fi^ 


-6. 


786 


75. 


859 


21 

ft- X 




-7. 


424 


74 . 


747 


20 




-7. 


209 


74 . 


729 


1 Q 


♦ 01*1 


-7. 


681 


77 . 


087 


21 


• JOO 


-8. 


142 


73. 


864 

U \J "1 


21 

fc. X 


• til? 


-8. 


840 


72 


797 


70 

c. U 


. 33 D 


-10. 


312 


73 


196 
■L y \j 


^1 u 


• *3J fi 


-10 . 


\J X \J 


7 7 


ft 7 7 


1 Q 


*5 1 / 
. OX*i 


-8. 


772 


71. 


532 


21 


.394 


-11. 


14 9 


10 


*7 "7 /I 
/ /4 


21 


. 275 


-12. 


546 


73. 


108 


21 


.233 


-13. 


321 


72. 


011 


20 


.475 


-12. 


815 


71. 


509 


19 


.460 


-12. 


741 


74. 


445 


20 


.540 


-14. 


483 


71. 


674 


21 


.023 


-15. 


343 


70. 


702 


20 


.406 


0. 


000 


-12. 


546 


73 


.108 


18. 


817 


0. 


000 


-12 


.546 


0. 


000 


0. 


000 


0. 


000 
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Table 10 







Atom 


Atom 


Position 


X 


Y 


z 


Number 


type 


in peptide 








0 


N 


0 


0.000 


0.000 


0.000 


1 


CA 


0 


18.249 


86.312 


21.629 


2 


C 


0 


16.910 


86.341 


22.345 


3 


0 


0 


16.646 


87.271 


23.139 


4 


CB 


0 


0.000 


0.000 


0.000 


5 


n ; 


1 


16.080 


85.351 


22.027 


6 


CA 


1 


*14.782 


85.213 


22.662 


7 


C 


1 


14.078 


83. 978 


22.127 


8 


0 


1 


12.999 


84 . 095 


21 . 505 


9 


CB 


1 


13. 932 


86. 434 


22.357 


10 


N 


2 


14.712 


82 . 828 


22 .345 


11 


CA 


2 


14.144 


81 . 558 


21. 938 


12 


C 


2 


12.613 


81 . 689 


21 . 812 


13 


0 


2 


11.912 


81.568 


22 . 828 


14 


CB 


2 


14 . 484 


80 .486 


22. 959 


15 


N 


3 


12.179 


81 . 964 


20 . 587 


16 


CA 


3 


10.775 


82 . 068 


20.300 


17 


C 


3 


10. 163 


80. 658 


20.176 


18 


0 


3 


10.712 


79. 826 


19.439 


19 


CB 


3 


10.564 


82. 834 


19.005 


20 


N 


4 


9.085 


80.454 


20 . 925 


21 


CA 


4 


8.374 


79.206 


20.882 


22 


C 


4 


7.026 


79.401 


20.159 


23 


0 


4 


6.568 


80.546 


20.036 


24 


CB 


4 


8.130 


78.697 


22.292 


25 


N 


5 


6.482 


78.283 


19.690 


26 


CA 


5 


5.203 


78.295 


19.035 


27 


C 


5 


4.087 


78.033 


20.066 


28 


0 


5 


4.298 


77.235 


20.991 


29 


CB 


5 


5.163 


77.229 


17.954 


30 


N 


6 


2.980 


78.741 


19.876 


31 


CA 


6 


1.833 


78.572 


20.726 


32 


C 


6 


1.164 


77.213 


20.434 


33 


0 


6 


1.603 


76.513 


19.510 


34 


CB 


6 


U.OJS 


79. 695 


20 . 486 


35 


N 


7 


0.169 


76.899 


21.254 


JO 


CA 


7 


-0.585 


75.687 


21.080 


37 


C 


7 


-2.092 


76.013 


21.037 


38 


0 


7 


-2.667 


76.338 


22.086 ' 


39 


CB 


7 


-0.300 


74.729 


22.223 


40 


N 


8 


-2.639 


75.944 


19.829 


41 


CA 


8 


-4.045 


76.173 


19.635 


42 


C 


8 


-4.853 


75.344 


20.653 


43 


0 


8 • 


-4.314 


74.368 


21.198 



SUBSTITUTE SHEET (RULE 26) 



WO 98/59244 



PCT/GB98/01801 



- 30 - 



Table 10 continued 



Atom 
Number 



Atom 
type 



Position 
in peptide 



44 
45 
46 
47 
48 
49 
50 
51 
52 
53 
54 
55 
56 

57 

58 

59 

60 

61 

62 

63 

64 



CB 
N 

CA 

C 

0 

CB 
N 

CA 

C 

O 

CB 

N 

CA 

C 
0 

CB 
N 

CA 

C 

0 

CB 



8 
9 
9 
9 
9 
9 

10 
10 
10 
10 
10 
11 
11 

11 
11 
11 
12 
12 
12 
12 
12 



-4.445 
-6.082 
-6.974 
-8.018 
-8.754 
-7. 679 
-8.002 
.-8. 947 
-10.274 
-10.348 
-9.194 
-11.256 
-12.539 

-13.542 
-13.224 
-12.418 
-14.678 
-15.731 

0.000 
18.616 

0.000 



75.782 
75.791 
75.097 
74.312 
74.928 
76.089 
72.999 
72.137 
72.891 
73.727 
70.899 
72.533 
73.179 

72.288 
71.836 
74.524 
72.054 
71.281 
-12.539 
0.000 
0.000 



18.223 
20.882 
21.769 
20.948 
20.163 
22.67 9 
21.144 
20.488 
20.269 
19.356 
21.332 
21.087 
21.038 
20.278 
19.167 
20.343 
20.925 
20.326 
73.179 
-12.539 
0.000 
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Table 11 



Backbone 104 



Atom 
Number 



Atom 
type 



Position 
in peptide 



0 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
38 
39 
40 
41 
42 
'43 



N 

CA 

C 

0 

CB 

N 

CA 

C 

O 

CB 

N 

CA 

C 

0 

CB 

N 

CA 

C 

O 

CB 

N 

CA 

C 

O 

CB 

N 

CA 

C 

O 

CB 

N 

CA 

C 

O 

CB 
N 

CA 

C 

O 

CB 

N 

CA 

C 

O 



0 
0 
0 
0 
0 
1 
1 
1 
1 
1 
2 
2 
2 
2 
2 
3 
3 
3 
3 
3 
4 
4 
4 
4 
4 
5 
5 
5 
5 
5 
6 
6 
6 
6 
6 
7 
7 
7 
7 
7 
8 
8 
8 
8 



0. 


000 


0. 


000 


0. 


000 


18. 


400 


86. 


585 


22. 


355 


16. 


914 


86. 


850 


22. 


523 


16. 


453 


87. 


991 


22. 


296 


0. 


000 


0. 


000 


0. 


000 


16. 


189 


85. 


793 


22. 


880 


14 . 


763 


85. 


897 


23. 


128 


14 . 


059 


84. 


662 


22. 


593 


12. 


980 


84. 


778 


21. 


971 


14. 


210 


87. 


122 


22. 


421 


14 . 


693 


83. 


511 


22. 


810 


14. 


125 


82. 


241 


22. 


404 


12. 


594 


82. 


372 


22. 


277 


11 . 


945 


82. 


807 


23. 


241 


14 . 


465 


81. 


169 


23. 


424 


12. 


104 


82. 


026 


21. 


093 


10. 


690 


82. 


048 


20. 


837 


10. 


159 


80. 


604 


20. 


723 


10. 


919 


79. 


713 


20. 


317 


10. 


406 


82. 


801 


19. 


548 


8. 


902 


80. 


444 


21. 


120 


8. 


250 


79. 


166 


21. 


029 


6. 


905 


79. 


319 


20. 


290 


6. 


415 


80. 


450 


20. 


160 


8. 


009 


78. 


605 


22. 


420 


6. 


401 


78. 


185 


19. 


817 


5. 


130 


78. 


158 


19. 


147 


4. 


011 


77. 


862 


20. 


165 


4 . 


164 


76. 


935 


20. 


975 


5. 


135 


77. 


091 


18. 


066 


2. 


968 


78. 


680 


20. 


096 


1. 


823 


78. 


502 


20 . 


947 


1. 


166 


77. 


138 


20. 


656 


1. 


718 


76. 


360 


19. 


864 


0. 


819 


79. 


617 


20. 


708 


0. 


047 


76. 


906 


21. 


334 


-0. 


707 


75. 


699 


21. 


135 


-2. 


213 


76. 


030 


21. 


083 


-2. 


793 


76. 


357 


22. 


129 


-0. 


435 


74. 


724 


22. 


267 


-2. 


754 


75. 


961 


19. 


873 


-4 . 


157 


76. 


194 


19. 


670 


-4. 


974 


75.368 


20. 


684 


-4. 


444 


74. 


387 


21. 


228 
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Table 11 continued 



Atom 
Number 



Atom 
type 



Position 
in peptide 



44 

45 

46 

47 

48 

49 

50 

51 

52 

53 

54 

55 

56 

57 

58 

59 

60 
61 
62 
63 
64 



CB 
N 

CA 

C 

0 

CB 

N 

CA 

C 

0 

CB 
N 

CA 

C 

0 

CB 
N 

CA 

C 

0 

CB 



Q 
O 




J JU 


/ D . 


q n *3 


1 O ft C £ 

18 . 256 


ft 

y 


D • 




7 c 


Q O A 


ft ft ft 1 n 

20.911 


9 


/ • 


J.UU 


7 C. 


1 *3 A 
1 34 


ft i n ft a 

21.794 


9 




i a a 
14 b 


7 A 
1 4 . 


O C O 

358 


ft ft r\ r f\ 

20 . 969 


9 


-8. 


997 


74. 


991 


20.328 


9 


-7. 


800 


76. 


129 


22.704 


10 


-8. 


007 


73. 


038 


21.000 


10 


.-8. 


934 


72. 


175 


20.320 


10 


-10. 


266 


72. 


919 


20.092 


10 


-10. 


341 


73. 


752 


19.177 


10 


-9. 


181 


70. 


924 


21.145 


11 


-11. 


249 


72. 


557 


20.907 


11 


-12. 


537 


73. 


194 


20.850 


11 


-13. 


529 


72. 


294 


20.086 


11 


-13. 


514 


72. 


297 


18.847 


11 


-12. 


421 


74. 


537 


20.152 


12 


-14. 


310 


71. 


549 


20.860 


12 


-15. 


320 


70. 


695 


20.297 


12 


0. 


000 


-12. 


537 


73.194 


12 


18. 


422 


0. 


000 


-12.537 


12 


0. 


000 


0. 


000 


0.000 
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Table 12 



Backbone 107 



Atom 
Number 



Atom 
type 



Position 
in peptide 



0 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
38 
39 
40 
41 
42 
43 



N 

CA 

C 

O 

CB 

N 

CA 

C 

O 

CB 

N 

CA 

C 

0 

CB 
N 

CA 

C 

O 

CB 

N 

CA 

C 

O 

CB 

N 

CA 

C 

0 

CB 
N 

CA 

C 

O 

CB 

N 

CA 

C 

0 

CB 
N 

CA 

C 
O 



0 
0 
0 
0 
0 

1 
1 
1 
1 
1 

2 
2 
2 
2 
2 
3 
3 
3 
3 
3 
4 
4 
4 
4 
4 
5 
5 
5 
5 
5 
6 
6 
6 
6 
6 
7 
7 
7 
7 
7 
8 
8 
8 
8 



0. 


000 


0. 


000 


0. 


000 


18 . 


468 


86. 


641 


22. 


418 


16. 


971 


86. 


870 


22 . 


533 


16. 


491 


87. 


999 


22. 


287 


0. 


000 


0. 


000 


0. 


000 


16. 


260 


85. 


796 


22. 


868 


14. 


825 


85. 


866 


23. 


065 


14. 


159 


84. 


664 


22. 


417 


13. 


215 


84. 


830 


21. 


612 


14. 


282 


87. 


132 


22. 


424 


14. 


680 


83. 


484 


22. 


746 


14. 


125 


82. 


241 


22. 


248 


12. 


597 


82. 


381 


22. 


089 


11. 


931 


82. 


822 


23. 


038 


14. 


438 


81. 


109 


23. 


212 


12. 


131 


82. 


035 


20. 


895 


10. 


723 


82. 


065 


20. 


608 


10. 


187 


80. 


624 


20. 


484 


10. 


876 


79. 


773 


19. 


902 


10. 


472 


82. 


818 


19. 


314 


9. 


010 


80. 


419 


21. 


065 


8. 


.357 


79. 


140 


20. 


.993 


6 


911 


79. 


322 


20 


.491 


6 


.290 


80 


.350 


20 


.801 


8 


. 346 


78 


.486 


22 


.364 


6 


.465 


78 


.339 


19 


.718 


5 


.120 


78 


.340 


19 


.212 


4 


.131 


78 


.069 


20 


.363 


4 


.469 


77 


.306 


21 


.280 


4 


.966 


77 


.274 


18 


.142 


2 


. 983 


78 


.731 


20 


.275 


1 


flirt 

. 940 


78 


. 547 


21 


.246 


0 


.842 


77 


.634 


20 


.665 


0 


.733 


77 


.533 


19 


.433 


1 


.341 


79 


.890 


21 


.628 


0 


.115 


76 


.994 


21 


.573 


-0 


.978 


76 


.143 


21 


.187 


-2 


.002 


76 


.952 


20 


.366 


-1 


.726 


78 


.116 


20 


.039 


-1 


.650 


75 


.569 


22 


.422 


-3 


.106 


76 


.287 


20 


.048 


-4 


.175 


76 


.921 


19 


.326 


-5 


.514 


76 


.242 


19 


.676 


-6 


.165 


75 


.692 


18 


.775 
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Table 12 continued 



Atom 
Number 



Atom 
type 



Position 
in peptide 



44 
45 
46 
47 
48 
49 
50 
51 
52 
53 
54 
55 
56 
57 
58 
59 
60 
61 
62 
63 
64 



CB 
N 

CA 

C 

O 

CB 

N 

CA 

C 

0 

CB 
N 

CA 
C 

o 

CB 
N 

CA 
C 

0 

CB 



8 
9 
9 
9 
9 
9 

10 
10 
10 
10 
10 
11 
11 
11 
11 
11 
12 
12 
12 
12 
12 



-3. 


925 


76. 


820 


17. 


831 


-5. 


836 


76. 


283 


20. 


964 


-7. 


077 


75. 


736 


21. 


439 


-7. 


625 


74. 


721 


20. 


416 


-7. 


330 


74. 


855 


19. 


219 


-8. 


090 


76. 


849 


21 . 


649 


-8. 


358 


73. 


746 


20. 


940 


-8. 


977 


72. 


751 


20. 


108 


-10.440 


73. 


147 


19. 


824 


-10. 


703 


73. 


773 


18. 


787 


-8. 


938 


71. 


398 


20. 


799 


-11. 


313 


72. 


734 


20. 


735 


-12. 


708 


73. 


067 


20. 


635 


-13. 


493 


71. 


860 


20. 


085 


-13. 


050 


71. 


253 


19. 


099 


-12. 


892 


74. 


262 


19. 


715 


-14. 


591 


71. 


556 


20. 


766 


-15. 


455 


70. 


486 


20. 


348 


0. 


000 


-12. 


708 


73. 


067 


18. 


675 


0. 


000 


-12. 


708 


0. 


000 


0. 


000 


0. 


000 
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Table 13 



Backbone 112 



Atom 
Number 



Atom 
type 



Position 
in peptide 



x 




y 




Cm 




0. 


000 


0 


.000 


0. 


000 


18. 


408 


86 


.726 


22. 


399 


16. 


919 


86 


.606 


22. 


121 


16. 


449 


87 


.028 


21. 


041 


0. 


000 


0 


.000 


0. 


000 


16. 


215 


86.005 


23. 


077 


14. 


774 


85 


858 


22. 


981 


14. 


438 


84 


. 649 


22. 


125 


14. 


190 


84 


.795 


20. 


907 


14. 


176 


87 


. 097 


22. 


337 


14. 


470 


83 


-480 


22. 


761 


14. 


125 


82 


.241 


22. 


093 


12. 


600 


82 


. 176 


21. 


872 


11. 


849 


82 


. 152 


22. 


858 


14. 


572 


81 


. 057 


22. 


932 


12. 


224 


82 

<J £m 


187 


20. 


598 


10. 


839 


82 


08^ 


20. 


230 


10. 


319 


80 


669 


20. 


557 


11. 


133 


79 


744 


20. 


692 


10. 


674 


O 4m 


• O O _7 


18. 


745 


9. 


001 


o u 


^8 ^ 

• JO J 


20. 


701 


8. 


361 


7 9 


• -J C. J 


20. 


960 


6. 


868 


1 Q 


4 11 
.411 


20. 


585 


6. 


126 


80 
o u 


1 ^8 
• Ijo 


21. 


239 


8. 


500 


7 8 


Q£1 
• J D X 


22. 


429 


6. 


516 


78 


67 6 


19. 


537 


5. 


150 


78 


.615 


19. 


095 


4 . 


229 


78 


.301 


20. 


291 


4 . 


706 


77 


.734 


21. 


285 


4. 


995 


77 


.540 


18. 


033 


2. 


976 


78 


.716 


20. 


149 


1. 


986 


78 


.455 


21. 


158 


0. 


948 


77 


.449 


20. 


621 


1. 


060 


77 


.031 


19. 


459 


1. 


291 


79 


.747 


21. 


552 


0. 


020 


77 


.088 


21. 


499 


-1 , 


045 


76 


.194 




1 JO 


-2. 


219 


76 


.999 


20. 


540 


-2. 


062 


78 


.205 


20. 


301 


-1. 


517 


75 


.422 


22. 


353 


-3. 


314 


76 


.286 


20. 


301 


-4. 


508 


76 


.904 


19. 


793 


-5. 


720 


75 


.987 


20. 


056 


-5. 


881 


74 


.984 


19. 


345 


-4. 


369 


77 


.156 


18. 


302 


-6. 


483 


76 


.357 


21. 


078 



0 

1 

2 

3 

4 

5 

6 

7 

8 

9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 ' 
35 
36 
37 
38 
39 
40 
41 
42 
43 
44 
45 



N 

CA 

C 

0 

CB 

N 

CA 

C 

0 

CB 

N 

CA 

C 

0 

CB 
N 

CA 

C 

0 

CB 
N 

CA 

C 

0 

CB 
N 

CA 

C 

6 

CB 
N 

CA 

C 

0 

CB 
N 

CA 

C 

0 

CB 
N 

CA 

C 

0 

CB 
N 



0 

0 

0 

0 

0 

1 

1 

1 

1 

1 

2 

2 

2 

2 

2 

3 

3 

3 

3 

3 

4 

4 

4 

4 

4 

5 

5 

5 

5 

5 

6 

6 

6 

6 

6 

7 

7 

7 

7 

7 

8 

8 

8 

8 

8 

9 
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Table 13 continued 



Atom 


Atom 


Position 


X 


v 


Z 




type 


in peptide 








46 


CA 


9 


-7.676 


75.631 


21. 417 


47 ■ 


C 


9 


-7.858 


74.446 


20. 447 


48 


0 


9 


-7.297 


74 .482 


19.341 


49 


-CB 


9 


-8.883 


76.549 


21.338 


50 


N 


10 


-8.598 


73.451 


20.920 


51 


CA 


10 


-8.898 


72.298 


20.116 


52 


c 


10 


-10.415 


72.236 


19.842 


53 


o 


10 


-11.204 


72.400 


20.784 


54 


CB 


10 


-8.455 


71.034 


20. 832 


55 


N 


11 


-10.740 


72.040 


18 . 569 


56 


CA 


11 


-12.112 


71.910 


18. 163 


57 


C 


11 


-12.689 


70.583 


18. 695 


58 


0 


11 


-12.384 


69.523 


18.128 


59 


CB 


JL x 


_ 1 O Oil 


71.942 


16. 648 


60 


N 


12 


-13.459 


70.705 


19.770 


61 


CA 


12 


-14 .109 


69.563 


20.354 


62 


C 


12 


0.000 


-12.112 


71. 910 


63 


0 


12 


18.708 


0.000 


-12.112 


64 


CB 


12 


0.000 


0.000 


0.000 
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nacKDone xx 


8 


Atom 


Atom 


Position 


X 


y 


z 




type 


x 11 pepume 








U 


N 


0 


0 000 


o nnn 


n nnn 


X 


CA 


0 


18.471 


8 6 S16 

O U • J JO 


9 9 4 fi7 
ZZ • 4 U / 


Z 


c 


0 


16. 968 


86 701 

<-> U « / V/ X 


99 9 66 
ZZ • ZOO 


*3 


0 


0 


16.498 


87 742 


91 7^^ 

Z X * t ijZJ 




CB 


0 


0.000 


0 000 


o nnn 




N 


1 


16.246 


85 665 


99 6ft 6 
ZZ • o o o 


£ 
O 


CA 


1 


14 .795 


85 690 


22 661 

*C Z . UUJ 


7 


C 


1 


14 .271 


84 . 435 


21 386 

Z X • -7 U U 


ft 

u 


0 


1 


13. 620 


84 525 


90 Q9? 

Z V/ . -7 Z Z 


q 


CB 


1 


14 .318 


86. 904 


2 1 RP.4 

Z X • OOn 


1 0 


N 


2 


14 .591 


83 292 


99 Q 

Z Z • JU -7 


1 1 

X X 


CA 


2 


14 . 125 


82 01 ? 


99 noi 

Z Z • J 


xz 


C 


2 


12 591 


89 04 ^ 

O Z . \JH D 


91 Q14 
z X . Z7 J fi 


1 ^ 
x o 


0 


2 


11 881 

-A. -A- . O O X 


ft2 067 


99 Q^l 

ZZ . yji 


1 4 

X fx 


CB 


2 


14 518 


RO Q07 


9 7 n^7 


X D 


N 


3 


X £ . XQJ 


R 9 HQ 1 
oz. UoX 


o n ann 
zU . O / / 


1 6 

X u 


CA 


3 


10 76? 

X v . / DZ 


ft 9 

OZ* uo4 


on "3 
zu . Job 


X / 


c 


3 


■LU.ZZl 


OU • OZ J 


OA yl "7 Q 

zU . 4/9 


1 ft 
XO 


0 


3 


XX . UUj 


"7 Q .CO /I 

/ y . o /4 


on o x o 

20. 343 


X y 


CB 


3 


X U . D J O 


OO coo 
OZ. ODD 


1 O QCO 


on 
zU 


N 


4 


ft Q 9 R 
O • 7Z J 


on ci A 1 


on o c ^ 
zu. 7 5 6 


z X 


CA 


4 


8 9 6** 

O • Z D j 


7 Q 9 CQ 


OH Q / c: 
ZU . o4 O 




C 


! 4 


6 R7Q 

U a O / 


7 Q ^ t^9 
/ y . jjZ 


on 1*71 
zU . x / X 


ZJ 


o 


I 4 


O • jZ j 




■on A"7A 

20 . 070 


9 4 
Zf* 


CB 


4 


8 101 

o . X v X 


7 P ft 6Q 
/ O • O DO 


o o *5 n i 
zz • JUX 


O 


N 


5 


6.413 


7fl 1 
/ O • X j D 


1 Q 7 I £ 
17. /ID 


26 
z u 


CA 


5 


1 5,115 


7 8 10^ 


i a i n 6 

17* X U O 


27 


C 


5 


4 . 061 


77 7SS 


20 1 77 

Z U • X / f 


28 


0 


5 


4 . 217 


76 717 


20 ft 66 
Z U . 0 o o 


29 


CB 


5 


5. 122 


11 034 


18 n97 
X o . u z / 


30 


N 


6 


3 . 069 


78 632 


90 9ft 9 
Z U • Z 0 z 


31 


CA 


6 


1.984 


78 . 421 


21 202 
Z X • z u z 


32 


C 


6 


1.060 


77 308 


20 67n 
z u • u / u 


33 


. 0 


6 


1.327 


76.771 


1 Q Sft 4 

X 7 . JO 4 


34 


CB 


6 


1.192 


79.706 


21.374 


35 


N 
r4 


7 


0. 048 


76. 997 


21.472 


36 


CA 


7 


-0.928 


76.012 


21.093 


37 


C 


7 


-2.316 


76.673 


20.976 


38 


0 


7 


-2.546 


77.708 


21. 619 


39 


CB 


7 


-0.975 


74.902 


22.128 


40 


N 


8 


-3.150 


76.066 


20.139 


41 


CA 


8 


-4.496 


76.535 


19.959 


! 42 


C 


8 


-5.484 


75.538 


20.596 


43 


0 


8 


-5.163 


74.343 


20.680 


44 


CB 


8 


-4.801 


76.684 


18.479 
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Table 14 continued 



Atom 




rosiuion 


X 


y 


z 


Number 


type 


in peptide 








A C 

4 5 


XT 

N 


9 


-6.612 


76.081 


21.040 


A C 

4 6 


CA 


9 


-7 . 652 


75.273 


21. 615 


A 1 

47 


C 


9 


-8.169 


74 .268 


20.567 


A O 

48 


o 


9 


-o . 200 


74 . 604 


19. 374 


A Q 

4 9 


CB 


9 


— O T AC 

-b.7 95 


76.156 


22.087 


C A 

50 


N 


10 


O CIO 

-o . 513 


73 . 083 


21 . 059 


j1 


CA 


10 


_ Q ACQ 

-y . U59 


72 . 056 


20 . 214 


CO 

DZ 




10 


n A A 


/ 2 . 355 


19 . 925 


JJ 


U 


n a 
10 


— 1 1 O O 1 


/2 . 703 


20 . 859 


04 


wo 


i a 
10 


— o • y ji 


"7 A T A O 
/ 0 . / 03 


20.892 


JJ 


IN 


11 


— iu . o y 4 


"7 O OOfi 

/2 . 23 9 


18 . 64 9 


56 


CA 


11 


-12.254 


72.439 


18.229 


57 


C 


11 


-13.135 


71.287 


18.754 


cq 

JO 


r\ 
U 


li 


-l j . uyi 


70.187 


18 . 183 


59 


CB 


n 


-12.328 


72.490 


16.713 


60 


N 


12 


-13.856 


71.586 


19.828 


61 


CA 


12 


-14 .763 


70.632 


20.406 


62 


C 


12 


0.000 


-12.254 


72. 439 


63 


0 


12 


18.754 


0.000 


-12.254 


64 


CB 


12 


0.000 


0.000 


0.000 
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Table 15 



Backbone 129 



Atoxn 


Atom 


Position 


Number 


type 


in peptide 


U 


XI 

N 


0 


1 


CA 


! 0 


2 


C 


0 


3 


0 


0 


4 


CB 


0 


c 

o 


N 


1 


6 


CA 


1 


7 


C 


1 


8 


o 


1 


9 


CB 


1 


10 


KT 

N 


2 


11 


CA 


2 


12 


c 


2 


13 


0 


2 


14 


CB 


2 


15 


XT 

N 


3 


16 


CA 


3 


17 


C 


3 


18 


0 


3 


19 


CB 


3 


20 


N 


4 


21 


CA 


4 


22 


C 


4 


2o 


r\ 
vJ 


4 


24 


Ct> 


4 


23 


XT 


5 


o a 

£. D 


pa 


5 


9*7 




.-5 


OP 




5 


0 Q 




c 
O 


o u 




/- 
D 


^1 


pa 


/- 
0 




p 


D 




n 


/- 
D 






6 


O C 


IN 


7 






7 


37 


P 


/ 


38 


0 


7 


39 


CB 


7 


40 


N 


8 


41 


CA 


8 


42 


C 


8 


43 


0 


8 


44 


CB 


8 


45 


N 


9 


46 


CA 


9 


47 


C 


9 



0 . 


000 


0. 


000 


0. 


000 


18 . 


495 


86. 


291 


22. 


091 


17 . 


099 


86. 


364 


22. 


686 


16, 


668 


87 . 


449 


23.137 


0. 


000 


0. 


000 


0. 


000 


16, 


409 


85. 


228 


22. 


645 


♦15. 


079 


85. 


125 


23. 


217 


14 . 


331 


83. 


972 


22. 


570 


13. 


400 


84. 


204 


21. 


766 


14 . 


313 


86. 


412 


22.964 


14. 


767 


82. 


758 


22. 


900 


14. 


125 


81. 


558 


22. 


404 


12. 


611 


81. 


805 


22.245 


11. 


911 


81. 


927 


23. 


261 


14. 


358 


80. 


407 


23. 


367 


12. 


194 


81. 


901 


20. 


988 


10. 


803 


82. 


082 


20. 


676 


10. 


173 


80. 


727 


20. 


297 


10. 


650 


80. 


085 


19. 


349 


10. 


652 


83. 


058 


19. 


522 


9. 


165 


80. 


348 


21. 


074 


8. 


445 


79. 


131 


20. 


819 


7 . 


047 


79. 


4 62 


20. 


257 


6. 


608 


80. 


615 


20. 


376 


8 . 


305 


78. 


330 


22. 


102 


6. 


442 


78. 


450 


19. 


647 


5 . 


114 


78. 


588 


19. 


113 


4 . 


079 


78 . 


178 


20. 


180 


4 . 


373 


77. 


289 


20. 


993 


4 . 


955 


77 . 


714 


17. 


881 


2 . 


945 


78. 


866 


20. 


145 


1 . 


864 


78. 


568 


21. 


044 


1 . 


193 


77 . 


243 


20. 


630 


1 . 


658 


76. 


606 


19. 


673 


0. 


841 


79. 


690 


21. 


018 


0. 


165 


76. 


881 


21. 


388 


-0. 


594 


75. 


695 


21. 


099 


-2. 


093 


76. 


044 


21. 


014 


-2. 


691 


76. 


384 


22. 


046 


-0. 


369 


74. 


657 


22. 


184 


-2. 


610 


75. 


977 


19. 


793 


-4. 


006 


76. 


226 


19. 


560 


-4. 


854 


75. 


414 


20. 


559 


-4. 


305 


74. 


533 


21.237 


-4. 


374 


75. 


835 


18. 


139 


-6. 


130 


75. 


774 


20. 


624 


-7. 


058 


75. 


079 


21. 


473 


-8. 


093 


74. 


330 


20. 


610 
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Table 15 continued 



5 



15 



A wUlU 


Atom 


Position 


X 


y 


z 


Number 


type 


in peptide 








4fi 




Q 

y 


— R 7 Q7 


*7 A Q*7 A 


19. 819 






Q 


— 7 7 fiP 
' • / DO 


' D . Ubb 


22 • 384 


50 


N 
in 


1 fi 


-8 1 07 


7 *3 m *5 

/ O . UXj 


Zu. / ol 


51 


CA 


1 0 


-9 04 9 


7 9 1 P 1 


on n q o 


52 




1 0 


-10 358 


7 9 Q£9 


1 Q Q / Q 

i y • o 4 o 


53 


o 




-10 355 


7 3 Q9 1 


1 Q H£0 


54 


CB 


10 


-9 . 337 


70 Q9Q 


on QQQ 


55 


N 


11 


-.11.409 


72 4 93 


^ U • JlU 


56 


CA 


11 


-12.689 


73.142 


20.432 


57 


C 


11 


-13.742 


72.155 


19.889 


58 


0 


11 


-13.537 


71.595 


18. 802 


59 


CB 


11 


-12.603 


74.353 


19.519 


60 


N 


12 


-14 .788 


71.968 


20.684 


61 


CA 


12 


-15.877 


71.114 


20.295 


62 


C 


12 


0.000 


-12.689 


73.142 


63 


0 


12 


18.488 


0.000 


-12.689 


64 


CB 


12 


0.000 


0.000 


0.000 
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Table 16 





4 


Atom 


Atom 


Position 


x 


v 


d 


iNuHLDeiv 


type 


in peptide 








0 


N 


0 


0.000 


0 000 




1 


CA 


o 


19.230 


86. 312 


21 629 

Cm X • SJC. m* 


o 
z 


r 


o 


16 891 


86 341 


29 34 ^ 
ZZ • J 4 O 






o 


16 627 

W . W Cm I 


87 271 

O / . Cm I X 


zo • ij y 




CB 


o 


0 000 


n nnn 

U • uuu 


n (inn 

Ut UUU 


0 


N 


1 

X 


16 061 

X V • V U X 


O O • O 0 1 


9 9 no*7 
ZZ . UZ / 


0 


CA 


1 


1*4 7 63 


P S 21 ^ 
o o • z 10 


99 ^^9 
ZZ • ooz 


7 

/ 


c 


1 

X 


14 0S9 


O j . D / o 


99 1 9"7 
ZZ . 1Z / 


Q 

0 




1 

X 


12 9P0 

X 4. • _7 O U 


q / not; 
o 4 • u y o 


91 cnt; 
Zl . OUO 


Q 

y 




1 


13 91 3 


0 D • 4 J4 


9 9 *3 c."7 
ZZ .00/ 


i n 


N 


2 

Cm 


14 6Q3 

xl • D«7 J 


O Z . OZo 


9 9 *5 yl c. 
ZZ . o 4 O 


i 1 
IX 


CA 


2 

Cm 


x4 . 1Z J 


PI c: c q 
0 1. 00 o 


zi . yoo 


1Z 


c 


2 

Cm 


12 SQ4 


P 1 Q 

oi. do y 


91 Q 1 9 

Zl • olz 


1 ^ 
1j 


o 


2 


11 P 

1 1 • 0 J j 


01 • ODD 


99 QOQ 
ZZ . OZO 


14 


CB 


o 

Cm 


14 4 £^ 
14 • 4 O 3 


Q f\ AQ £ 
O U . 4 o b 


zz . yoy 


10 


IN 


9 
O 


10 1 £Pl 
x Z . 1 ou 


ol . y o4 


20.587 


lb 


CA 


O 

o 


XU . / O O 


Oct • Uoo 


20 . 300 


1 9/ 




9 
O 


in i a a 

1U . 14 4 


on £T CO 

oU . boo 


On 1 ~7 r" 

20.176 






0 


lu • oyj 


7 9 . 826 


19 . 439 


1 Q 

iy 






i n r 
1U • 04 3 


O O O O A 

oz . oo4 


in n n c 

19 . 005 


o a 
20 


IN 


ft 




O O A C A 

ou . 454 


n n n n c 

20 . 925 






A 
4 


O . ODD 


7 9.206 


n n n n n 

20 . 882 


o o 
22 


c 


4 


i nn"7 

/ . UU / 


/ y . h ui 


OA 1 C A 

20 . loy 


9 'J 

2o 


o 


4 


£ Q / Q 

o . 04 y 




on n o £. 


9 A 
Z4 




4 


p ill 

O * 111 


TO C0"7 

/ o . oy / 


on O Q O 

22.292 


9 


N 


o 


6 4 6*3 


IP 9 P "3 
/ O . Z O O 


i q con 

i y . oy u 


9 £ 

z o 


CA 


5 

<■»» 


5 184 


7P 2Q^ 

f 0 m Z -7 0 


i q n ^ c; 
l y . uoo 


99 
Z / 


c 


5 


4 . 068 


78 033 


9 n n a £ 


28 


0 


5 


4 .279 


77 .235 


20 9 91 


29 


CB 


5 


5. 144 


77 .229 


1 7 9S4 ! 


30 


N 


6 


2. 961 


78 741 


1 9 P7 n ' 


31 


CA 


6 


1. 814 


78 . 572 


20 726 

Cm \J • I Cm \J 


32 


C 


6 


1. 146 


77 .213 


20 4 34 

«. U • 4 J 4 


33 


0 


6 


1.584 


76 513 


19 sin 

J- -7 . J X U 


34 


CB 


6 


0.820 


79 695 


20 4 P 6 

U • 4 O O 


35 


N 


7 


0.150 


76 899 


91 9^4 
Z 1 • Z O 4 


36 


CA 


7 


-0.604 


75.687 


21.080 


37 


c 


7 


-2.110 


76 013 


21 on 

Z x • UO / 


38 


0 


7 


-2.686 


76.338 


22.086 


39 


CB 


7 


-0.319 


74.729 


22.223 ; 


40 


N 


8 


-2.658 


75.944 


19.829 


41 


CA 


8 


-4.064 


76.173 


19.635 


42 


C 


8 


-4.872 


75.344 


20.653 


43 


0 


8 


-4.333 


74.368 


21.198 


44 


CB 


8 


-4.463 


75.782 


18.223 


45 


N 


9 


-6.101 


75.791 


20.882 


46 


CA 


9 


-6..993 


75.097 


21.769 
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Table 16 continued 



5 



10 



15 



Atom 


A L> will 


rOSlLlon 




y 


z 


Number 


type 


in peptide 








47 


C 


9 


-8.036 


74.312 


20.948 


48 


0 


9 


-8.773 


74.928 


20.163 


49 


CB 


9 


-7.698 


76.089 


22.679 


50 


N 


10 


-8.021 


72.999 


21.144 


51 


CA 


10 


-8 . 966 


72.137 


20.488 


52 


C 


10 


-10.293 


72.891 


20.269 


53 


0 


10 


-10.367 


73.727 


19.356 


54 


CB 


10 


-9.213 


70. 899 


21.332 


55 


N 


11 


-11.27 5 


72.533 


21.087 


56 


CA 


11 


-12.558 


73.179 


21.038 


57 


C 


11 


-13.561 


72.288 


20.278 


58 


0 


11 


-13.243 


71.836 


19.167 


59 


CB 


11 


-12.437 


74.524 


20.343 


60 


N 


12 


-14.696 


72.054 


20.925 


61 


CA 


12 


-15.750 


71.281 


20.326 


62 


C 


12 


0.000 


-12.558 


73.179 


63 


0 


12 


18.616 


0.000 


-12.558 


64 


CB 


12 


0.000 


0.000 


0.000 
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Table 17 



Backbone 141 



X 




y 




z 


0. 


000 


0 


.000 


0.000 


18. 


454 


86 


.485 


22.460 


16. 


950 


86 


.573 


22.266 


16. 


481 


87 


.224 


21.305 


0. 


000 


0 


.000 


0.000 


16. 


227 


85 


.893 


23.151 




776 


85 


.918 


23.128 


14. 


252 


84 


.663 


22.452 


13. 


601 


84 


.752 


21.387 


14. 


299 


87 


.132 


22.349 


14 . 


573 


83 


.520 


23.055 


14 . 


106 


82 


.241 


22.559 


12 . 


572 


82 


.273 


22.400 


11 . 


868 


82 


.483 


23.398 


14 . 


499 


81 


.135 


23.523 


12. 


141 


82 


.099 


21.156 


10. 


736 


82 


.054 


20.855 


10. 


224 


80 


.605 


20.973 


11. 


035 


79 


.698 


21.214 


10. 


489 


82 


.573 


19.449 


8 . 


911 


80 


.468 


20.833 


8. 


289 


79 


.172 


20.868 


6. 


823 


79 


.286 


20.405 


6. 


108 


80 


.179 


20.882 


8. 


338 


78 


.611 


22.279 


6. 


4 65 


78 


.404 


19.478 


5. 


118 


78 


.352 


18.981 


4. 


147 


78 


.042 


20.138 


4. 


521 


77 


.295 


21.054 ! 


4. 


999 


77 


.280 


17.911 ! 


2. 


972 


78 


.656 


20.055 


1. 


943 


78 


.430 


21.033 


1. 


020 


77 


.288 


20.562 


1. 


265 


76 


.719 


19.488 


1. 


130 


79 


.697 


21.234 


0. 


034 


76 


.991 


21.401 


-0. 


938 


75 


.983 


21.081 


-2. 


338 


76 


.622 


20.985 


-2 . 


577 


77 


. 649 


21. 637 


-0. 


939 


74 


.903 


22.150 


-3. 


173 


76 


.006 


20.156 


-4. 


529 


76 


.453 


19.995 


-5. 


492 


75 


.437 


20.641 


-5. 


144 


74 


.250 


20.729 


-4. 


856 


76 


.604 


18.520 


-6. 


629 


75 


.957 


21.087 


-7. 


649 


75 


.129 


21.670 


-7. 


625 


73 


.734 


21.014 



Atom 
Number 



Atom 
type 



Position 
in peptide 



0 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
38 
39 
40 
41 
42 
43 
44 
45 
46 
47 



N 

CA 

C 

0 

CB 
N 

CA 
C 

o 

CB 
N 

CA 

C 

O 

CB 
N 

CA 

C 

0 

CB 
N 

CA 

C 

0 

CB 
N 

CA 

C 

O 

CB 
N 

CA 

C 

0 

CB 
N 

CA 

C 

O 

CB 
N 

CA 

C 

0 

CB 
N 

CA 
C 



0 
0 
0 
0 
0 

1 
1 
1 
1 
1 

2 
2 
2 
2 
2 
3 
3 
3 
3 
3 
4 
4 
4 
4 
4 
5 
5 
5 
5 
5 
6 
6 
6 
6 
6 
7 
7 
7 
7 
7 
8 
8 
8 
8 
8 
9 
9 
9 
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Table 17 continued 



Atom 


Atom 


Position 


X 


y 


2 


x% UHUJCX 


uype 


in peptide 








48 


o 


q 


S^l 


/ O . ZUj 


. / 0 0 


49 


CB 

VlJ 


Q 

J 




1 D • 1 DO 


0 1 vl "7 n 


50 


N 




_ Q OOO 


*7 *a onn 
/ o . 2U0 


on q n o 
20 . oU J 


51 


CA 


1 n 
1 u 


— R Q^Ck 


"7 1 Q O C 

/ 1 . y2o 


on ice 
20 . lOD 


52 


r 
\-# 




1U . 4DU 


/ 1 . olo 


i y . y oy 


53 




1U 


— 1 1 net 


"7 A Q yi r 


on i o o 
20.788 


54 




1 n 
1U 


— Q O "3 A 
O . 


70.836 


21 . 005 


55 


IN 


11 


— in G Q 0 

iu . y o j 


"7 O 1 < ft 

72.148 


18.840 


56 


CA 


n 


-1*2.353 


71.910 


18.476 


57 


c 


11 


_ 1 0 "7*5 0 


*7 n /co 
/ 0 . 4 52 


1 O O A C 

18 . 805 


58 


0 


11 


-12.400 


69.551 


18.020 


59 


CB 


11 


-12.548 


72.168 


16.992 


60 


N 


12 


-13.373 


70.294 


19.958 


61 


CA 


12 


-13.836 


69.000 


20.380 


62 


C 


12 


0.000 


-12.353 


71. 910 


63 


0 


12 


18.541 


0.000 


-12.353 


64 


CB 


12 


0.000 


0.000 


0.000 
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Table 18 
Backbone 144 



Atom 
Number 



0 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
38 
39 
40 
41 
42 
43 
44 



Atom 
type 



N 

CA 

C 

0 

CB 

N 

CA 

C 

O 

CB 

N 

CA 

C 

0 

CB 

N 

CA 

C 

0 

CB 

N 

CA 

C 

0 

CB 

N 

CA 

C 

0 

CB 

N 

CA 

C 

0 

CB 
N 

CA 

C 

0 

CB 
N 

CA 
C 
0 
CB 



Position 
in peptide 



0 
0 
0 
0 
0 



2 
2 
2 
2 
2 
3 
3 
3 
3 
3 



5 
5 
5 
5 
5 
6 
6 
6 
6 
6 
7 
7 
7 
7 
7 
8 
8 
8 
8 
8 



0. 


000 


0. 


000 


0. 


000 


18. 


480 


86. 


428 


22 


392 


16. 


967 


86. 


551 


22 


343 


16. 


431 


87. 


361 


21. 


553 


0. 


000 


0. 


000 


0. 


000 


16. 


308 


85. 


727 


23. 


153 


1-4. 


861 


85. 


759 


23. 


256 


14. 


262 


84. 


64 3 


22. 


416 


13. 


512 


84. 


919 


21. 


454 


14. 


341 


87. 


091 


22. 


745 


14. 


630 


83. 


412 


22. 


767 


14. 


106 


82. 


241 


22. 


093 


12. 


565 


82. 


287 


22. 


092 


11. 


968 


82. 


501 


23. 


158 


14. 


581 


80. 


981 


22. 


796 


12. 


006 


82.121 


20. 


899 


10.578 


82. 


090 


20. 


743 


10. 


094 


80. 


628 


20. 


667 


10. 


880 


79. 


754 


20. 


273 


10.177 


82. 


830 


19. 


479 


8. 


846 


80. 


435 


21. 


077 


8.236 


79. 


135 


21. 


020 


6. 


879 


79. 


228 


20. 


292 


6. 


338 


80. 


337 


20. 


167 


8. 


027 


78. 


596 


22. 


424 


6. 


422 


78. 


073 


19. 


822 


5. 


148 


77. 


990 


19. 


162 


4. 


052 


77. 


64 5 


20. 


190 


4. 


068 


76. 


532 


20. 


737 


5. 


192 


76. 


923 


18. 


081 


3. 


184 


78. 


622 


20. 


423 


2. 


076 


78. 


436 


21. 


319 


1. 


134 


77. 


348 


20. 


765 


1. 


402 


76. 


819 


19. 


676 


1. 


313 


79. 


740 


21. 


481 


0. 


109 


77. 


048 


21. 


553 


-0. 


883 


76. 


089 


21. 


152 


-2. 


256 


76. 


780 


21. 


027 


-2. 


407 


77. 


911 


21. 


512 


-0. 


965 


74. 


968 


22. 


174 


-3. 


167 


76. 


084 


20. 


357 


-4. 


509 


76. 


574 


20. 


198 


-5. 


503 


75. 


588 


20. 


843 


-5. 


193 


74. 


391 


20.931 


-4. 


832 


76. 


735 


18. 


722 
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Table 18 continued 



Atom 


Atom 


Position 




y 


Z 


Number 


nype 


in peptide 








45 
46 
47 
48 
49 
50 


N 


9 


-6. 623 


76.144 


21.290 


CA 


9 


-7. 669 


75.348 


21.873 


C i 


9 


-8.201 


74.343 


20.832 


0 


9 


-8.407 


74.731 


19.672 


CB 


9 


-8.801 


76.243 


22.347 


N 


10 


-8.360 


73.106 


21.286 


51 
52 
53 
54 
55 
56 
57 
58 
59 
60 
61 
62 
63 
64 


CA 


10 


-8.894 


72.067 


20.448 


C 


10 


-10.383 


72.344 


20.162 


0 


10 


-11.124 


72.681 


21.097 


CB 


10 


-8.745 


70.719 


21.133 


N 


11 


-10.734 


72.224 


18.886 


CA 


11 


-12.097 


72.403 


18.469 


C 


11 


-12.907 


71.126 


18.774 


o 


1 1 

J. -L 


— 15 ft t^Q 


'U . 17 8 


17 . 977 


CB 


11 


-12.150 


72.700 


16.980 


N 


12 


-13.575 


71.155 


19.921 


CA 


12 


-14.414 


70.059 


20.322 


C 


12 


0.000 


-12.097 


72.403 


0 


12 


18.465 


0.000 


-12.097 


CB 


12 


0.000 


0.000 


0.000 



SUBSTITUTE SHEET (RULE 26) 



WO 98/59244 



PCT/GB98/01801 



- 47 - 

Example A 

The following method was used to identify high affinity 
binding peptides from Myelin Basic Protein (MBP) . The binding 
5 affinities for a set of MBP peptides to HLA-DRB1*0401 have 
been experimentally determined and published. This set 
includes all possible 13 amino acid peptides from the MBP 
seguence which have a hydrophobic anchor residue at the P3 
position, it is known that only such peptides bind to HLA-DR 
10 molecules with detectable affinity. 

The same homology model of HLA-DRB1*0401 was used for this 
example as was used in Examples 1 and 2. 

15 For each of the 13-mer peptides from the experimental 
determined set, a binding score was calculated as follows: 

a) Calculate the steric overlap between the pocket bound 
peptide residue in the binding groove and an atom forming the 

20 pocket; this is value B. 

b) Count the number of hydrogen bonds which could be formed 
between the pocket bound peptide residue and atoms forming the 
pocket; this is value C. 

25 

c) Calculate the strength of electrostatic interactions 
between any polar atoms of the pocket bound peptide residue 
and any polar atoms forming the pocket; this is value D. 

30 d) Count the number of favourable contacts between the pocket 
bound peptide residue and atoms forming the pocket; this is 
value E. 

e) These values were then transformed into a conformation 
35 score (2) by using the following eguation: 

Z n =cK 2 C-cK 3 D+cK 4 E-cKiB 
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Where K a to are constants and n is the sequence position of 
the peptide residue (numbered from l to the N-terminus to 13 
at the C-terminus). K lf K 2 , K 3 and K 4 are equal to 100, 1500, 
500 and 1000, respectively. 

5 

The conformation of each rotatable side-chain of the peptide 
residue was then altered by 15 degrees and the conformation 
score was recalculated. 

10 The above steps were repeated for each residue of the peptide 
and the highest conformation score for each peptide residue 
was sued to determine the conformation score for the peptide. 

At the point, the entire proceedings for establishing the 
15 conformation score for the peptide were repeated another 166 
times, each time using a different peptide backbone form the 
library of peptide backbones. 

The combination of peptide backbone and peptide side-chain 
20 conformations which gave the best conformation was then used 
to determine a binding score for the peptide. 

The binding score was determined by establishing values of the 
following parameters: 

25 

a) Calculate the steric overlap between the pocket bound 
peptide residue in the binding groove and an atom forming the 
pocket; this is value B. 

30 b) count the number of hydrogen bonds which could be formed 
between the pocket bound peptide residue and atoms forming the 
pocket; this is value C. 

c) Calculate the strength of electrostatic interactions 
35 between any polar atoms of the pocket bound peptide residue 
and any polar atoms forming the pocket; this is value D. 
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d) Count the number of favourable contacts between the 
pocket bound peptide residue and atoms forming the pocket; 
this is value E. 

5 e) Calculate the hydrophobicity of the pocket bound peptide 
side chains using a hydrophobicity scale disclosed in Janin 
et al. 

f) Calculate the number of MHC pocket residues which are 
10 paired with the pocket bound peptide residues. Pairing takes 
place if the centre of an atom from the MHC pocket residue and 
the centre of an atom from the pocket bound peptide residues 
are no more than the sum of their van der wall radii plus one 
Angstrom. The value A„ is calculated by summing the number of 
15 paired residues, where n is the number of the pocket. The 
values of A n taking into account the pockets importance in 
binding are summed to give a value P. 

The above values were then imported in to the following 
20 eguation in order to determine the binding score (Y) : 

Y=P+bK 2 C-bK 3 D+bK 4 E-bK 1 B+bK 5 He 

Wherein the values bKj, bK 2 , bK 3 , bK 4 and bK 5 are 2, 40, 600, 
25 10 and 200 respectively. 

As can be seen from the results in Table 19 the top four 
predicted scores pertain to four peptides which appear within 
the top five best binders. 
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Table 19 
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CLAIMS 

1. A method for the prediction of the binding affinity of 
a peptide to a major histocompatibility (MHC) class II 

5 molecules comprising; 

a) ascertaining the characteristics of a MHC molecule binding 
groove, 

b) presenting a selected peptide to the MHC molecule and 
ascertaining a first conformation score for each pocket bound 

10 peptide side-chain, 

c) amending the conformation of each pocket bound peptide 
side-chain and ascertaining a second conformation score, 

d) repeating step 3 with alternative conformations of each 
peptide pocket bound side-chain, 

15 e) choosing the highest conformation score for each pocket 
bound peptide side-chain in each binding groove pockets, 
herein known as 'the pocket', and 

f) combining the highest conformation score for each pocket 
and ascertaining a binding score for the complete peptide. 

20 

2. A method according to claim 1 which further comprises the 
step of compiling information on all peptide fragments in a 
protein and comparing the binding scores. 

25 3. A method according to any preceding claim wherein the 
conformation score is ascertained by at least one of the 
following parameters: 

a) the number of favourable contacts between MHC residues 
forming one of the pockets and the pocket bound peptide 

30 residue; this is value E 

b) the steric overlap between the pocket bound peptide residue 
bound in the pocket and an atom forming the pocket; this is 
value B, 

c) the number of hydrogen bonds which could be formed between 
35 the pocket bound peptide residue and an atom forming the 

pocket; this is value c, 
. d) the strength of electrostatic interactions between any 
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polar atoms of the pocket bound peptide residue and any polar 
atoms forming the pocket; this is value D. 

4. A method according, to claim 3 wherein the steric overlap 
5 between the pocket bound peptide residue and the atoms forming 

the pocket can not be greater than 0.3 5 Angstroms. 

5. A method according to claim 3 wherein a favourable 
contact occurs when an atom from an MHC residue and an atom 
from the peptide residue have their centres separated by no 
more than the sum of their radii plus 0.5 Angstroms and are 
not overlapping. 

6. A method according to the preceding claims wherein values 
15 B to E are imported into a first equation, to give a 

conformation score (Z) 

7. A method according to claim 6 wherein the first equation 
is Z n =(cK 2 C)-(cK 3 D) + (cK 4 E)-(cK 1 B) , where cK, to cK 4 are 
constants and n is the number of the pocket. 



10 



20 



8. A method according to claim 7 wherein cK a is between 50 
and 15 0. 

25 9. A method according to claim 7 wherein cK 2 is between 1000 
and 2000. 

10. A method according to claim 7 wherein cK 3 is between 250 
and 750. 



30 



11. A method according to claim 7 wherein cK 4 is between 500 
and 1500. 



12. A method according to any preceding wherein the Z n value 
35 for a pocket is multiplied by a coefficient, L, depending on 
the pockets importance in binding, to give a second Z n value. 
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13 . A method according to any of the preceding claims wherein 
all the Z values are summed to give a value J. 

14. A method according to any of the preceding claims wherein 
5 the MHC residue is paired with the pocket-bound peptide 

residue if an atom from the MHC residue and an atom from the 
pocket-bound peptide residue have their centres separated by 
no more than the sum of their van der Waal radii plus one 
Angstrom. 

10 

15. A method according to claim 14 wherein a value A n is 
calculated by summing the pairwise interaction frequencies of 
paired residues. 

15 16. A method according to either claim 14 or 15 wherein the 
value A n for a pocket is multiplied by a coefficient, X, 
depending on the pockets importance in binding. 

17. A method according to claim 16 wherein the A n value for 
20 the pockets are summed to give a value P. 

18. A method according to any preceding claim wherein the 
binding score is ascertained by at least one of the following 
parameters 

25 a) the number of groove-bound hydrophobic residues; this is 
value F, 

b) the number of non groove-bound hydrophilic residues; this 
is value G, 

c) the number of peptide residues deemed to fit within their 
30 respective binding pocket; this is value H. 

19. A method according to any one of claims 13 to 18 wherein 
values F, G, H, J and P are imported into a second equation 
to give a first binding score, Y. 

35 

20. A method according to claim 19 wherein the second 
algorithm is y=j*f 2 * (G*H+l) + p. 
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21. A method according to claim 1-17 wherein the 
hydrophobicity of the pocket bound peptide side chains is 
evaluated using a hydrophobicity scale; this is value He. 

5 22. A method according to claim 21 wherein the hydrophobicity 
scale ranges from -1.8 for lysine to 0.9 for cysteine. 

23. A method according to either of claims 21 or 22 wherein 
Y= (bK 2 C) - (bK 3 D) + (bK<E) - (bK a B) + (bK 5 He) +P . 

-10 

24. A method according to claim 23 wherein bK x is between 1 
and 5. 

25. A method according to claim 23 wherein bK 2 is between 20 
15 and 60. 

26. A method according to claim 23 wherein bK 3 is between 300 
and 900. 

20 27. A method according to claim 23 wherein bK 4 is between 1 
and 20. 

28. A method according to claim 23 wherein bK 5 is between 1 
and 800. 

25 

29. A method according to any preceding claim wherein the 
steps in claim 3 are repeated for each pocket and each 
conformation of the peptide residue in said pocket. 

30 30. A method according to claim 29 wherein the conformation 
of the peptide is altered by rotating a side chain of the 
peptide residue by a pre-determined amount. 

31. A method according to either claim 29 or 30 where in the 
35 conformation of the peptide is altered by changing the 
conformation of the peptide backbone. 
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32. A method according to any preceding claim wherein the 
steps are repeated using different peptides from a protein. 

33. A method according to any of the preceding claim wherein 
5 the binding scores (Y) for different peptides are tabulated 

and compared. 

34. A method according to any of the preceding claim which 
is used in the manufacture of a vaccine derived from a peptide 

10 identified by said method. 

35. A method according to any of the preceding claims which 
is used to remove potentially immunogenic sequences from a 
protein and thus reduce said proteins immunogenic ity when 

15 administered to an organism. 

36. A computer conditioned to receive information 
characterising a peptide bound to the MHC molecule and to 
utilise said information to perform a procedure having the 

20 following steps; 

a) ascertaining the characteristics of a MHC molecule binding 
groove ; v 

b) presenting a selected peptide, which is selected by a 
predetermined program, to the MHC molecule and ascertaining 

25 a first conformation score; 

c) amending the conformation of the peptide, by way of a 
predetermined program, and ascertaining a second conformation 
score ; 

d) repeating step 3 with other conformations of the peptide; 
30 e) selecting the peptide conformation with the highest 

conformation score; and 

f) calculating the binding score from the conf ormation score. 

37. A computer according to claim 36 further comprising a 
35 step (7) which comprises repeating steps 1-4 with other 

peptide fragments in the protein 

to generate information on all peptide fragments in a protein 
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so that a comparison can be made of the strength of the 
binding between the peptide and the MHC molecule. 

38. A computer according to either claim 36 or 37 further 
5 comprising a step (8) which comprises altering the 

conformation of the backbone of the peptide fragment. 

39. A pharmaceutical composition produced resultant upon to 
a method as claimed in anyone of claims l to 35. 

10 
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20 



25 
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MUTANT PROTEINS HAVING LOWER ALLERGENIC 
RESPONSE IN HUMANS AND METHODS FOR 
CONSTRUCTING, IDENTIFYING AND PRODUCING SUCH PROTEINS 

5 BACKGROUND OF THE INVENTION 

A. Field of the Invention 

The present invention relates to proteins which produce lower allergenic response 
in humans exposed to such proteins, and an assay predictive of such response. More 
specifically, the present invention relates to a novel improved protein mutant which 
10 produces very low allergenic response in humans sensitized to that protein through 
exposure compared to the precursor of such protein mutant. 

B. State of the Art 

Proteins used in industrial, pharmaceutical and commercial applications are of 

15 increasing prevalence. As a result, the increased exposure due to this prevalence has 
been responsible for some safety hazards caused by the sensitization of certain persons 
to those peptides, whereupon subsequent exposure causes extreme allergic reactions 
which can be injurious and even fatal. For example, proteases are known to cause 
dangerous hypersensitivity in some individuals. As a result, despite the usefulness of 

20 proteases in industry, e.g., in laundry detergents, cosmetics, textile treatment etc...., and 
the extensive research performed in the field to provide improved proteases which have, 
for example, more effective stain removal under detergency conditions, the use of 
proteases in industry has been problematic due to their ability to produce a hypersensitive 
allergenic response in some humans. 

25 Much work has been done to alleviate these problems. Among the strategies 

explored to reduce immunogenic potential of protease use have been improved production 
processes which reduce potential contact by controlling and minimizing workplace 
concentrations of dust particles or aerosol carrying airborne protease, improved 
granulation processes which reduce the amount of dust or aerosol actually produced from 

30 the protease product, and improved recovery processes to reduce the level of potentially 
allergenic contaminants in the final product. However, efforts to reduce the allergenicity of 
protease, per se, have been relatively unsuccessful. Alternatively, efforts have been 
made to mask epitopes in protease which are recognized by immunoglobulin E (IgE) in 
hypersensitive individuals (PCT Publication No. WO 92/10755) or to enlarge or change the 

35 nature of the antigenic determinants by attaching polymers or peptides/proteins to the 
problematic protease. 
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When an adaptive immune response occurs in an exaggerated or inappropriate 
form, the individual experiencing the reaction is said to be hypersensitive. Hypersensitivity 
reactions are the result of normally beneficial immune responses acting inappropriately 
and sometimes cause inflammatory reactions and tissue damage. They can be provoked 
5 by many antigens; and the cause of a hypersensitivity reaction will vary from one 

individual to the next. Hypersensitivity does not normally manifest itself upon first contact 
with the antigen, but usually appears upon subsequent contact. One form of 
hypersensitivity occurs when an IgE response is directed against innocuous environmental 
antigens, such as pollen, dust-mites or animal dander. The resulting release of 

10 pharmacological mediators by IgE-sensitized mast cells produces an acute inflammatory 
reaction with symptoms such as asthma or rhinitis. 

Nonetheless, a strategy comprising modifying the IgE sites will not generally be 
successful in preventing the cause of the initial sensitization reaction. Accordingly, such 
strategies, while perhaps neutralizing or reducing the severity of the subsequent 

15 hypersensitivity reaction, will not reduce the number or persons actually sensitized. For 
example, when a person is known to be hypersensitive to a certain antigen, the general, 
and only safe, manner of dealing with such a situation is to isolate the hypersensitive 
person from the antigen as completely as possible. Indeed, any other course of action 
would be dangerous to the health of the hypersensitive individual. Thus, while reducing 

20 the danger of a specific protein for a hypersensitive individual is important, for industrial 
purposes it would be far more valuable to render a protein incapable of initiating the 
hypersensitivity reaction in the first place. 

T-lymphocytes (T-cells) are key players in the induction and regulation of immune 
responses and in the execution of immunological effector functions. Specific immunity 

25 against infectious agents and tumors is known to be dependent on these cells and they 
are believed to contribute to the healing of injuries. On the other hand, failure to control 
these responses can lead to auto aggression. In general, antigen is presented to T-cells 
in the form of antigen presenting cells which, through a variety of cell surface 
mechanisms, capture and display antigen or partial antigen in a manner suitable for 

30 antigen recognition by the T-cell. Upon recognition of a specific epitope by the receptors 
on the surface of the T-cells (T-cell receptors), the T-cells begin a series of complex 
interactions, including proliferation, which result in the production of antibody by B-cells. 
While T-cells and B-cells are both activated by antigenic epitopes which exist on a given 
protein or peptide, the actual epitopes recognized by these mononuclear cells are 

35 generally not identical. In fact, the epitope which activates a T-cell to initiate the creation 
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of immunologic diversity is quite often not the same epitope which is later recognized by 
B-cells in the course of the immunologic response. Thus, with respect to hypersensitivity, 
while the specific antigenic interaction between the T-cell and the antigen is a critical 
element in the initiation of the immune response to antigenic exposure, the specifics of 
5 that interaction, i.e., the epitope recognized, is often not relevant to subsequent 
development of a full blown allergic reaction. 

PCT Publication No. WO 96/40791 discloses a process for producing polyalkylene 
oxide-polypeptide conjugates with reduced allergenicity using polyalkylene oxide as a 
starting material. 
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reaction has been initiated, sensitization has already occurred. Accordingly, there is a 
need for a method of determining epitopes which cause sensitization in the first place, as 
neutralization of these epitopes will result in significantly less possibility for sensitization to 
occur, thus reducing the possibility of initial sensitization. 

5 

SUMMARY OF THE INVENTION 

It is an object of the invention to provide a protein having decreased potential to 
cause allergenic response in humans compared to a precursor protein. 

It is a further object of the present invention to provide for a protease variant which 

10 has useful activity in common protease applications, such as detergents and or the 
treatment of wool to prevent felting, in bar or liquid soap applications, dish-care 
formulations, contact lens cleaning solutions or products, peptide hydrolysis, waste 
treatment, textile applications such as anti-felting, in cosmetic formulations and for skin 
care, or as fusion-cleavage enzymes in protein production, which protease variant can be 

15 more safely used due to its lowered allergenic potential. 

According to the present invention, a method for identifying T-cell epitopes within a 
protein is provided. The present invention provides an assay which identifies epitopes as 
follows: antigen presenting cells are combined with naive human T-cells and with a 
peptide of interest. In a preferred embodiment of the invention, a method is provided 

20 wherein a T-cell epitope is recognized comprising the steps of: (a) obtaining from a single 
blood source a solution of dendritic cells and a solution of naive CD4+ and/or CD8+ T- 
cells; (b) promoting differentiation in said solution of dendritic cells; (c) combining said 
solution of differentiated dendritic cells and said naive CD4+ and/or CD8+ T-cells with a 
peptide of interest; (d) measuring the proliferation of T-cells in said step (c). 

25 According to another embodiment of the present invention, a protein is provided in 

which a T-cell epitope is modified so as to reduce or preferably neutralize (eliminate) the 
ability of the T-cell to identify that epitope. Thus, a protein is provided having reduced 
allergenicity, wherein said protein comprises a modification comprising the substitution or 
deletion of amino acid residues which are identified as within a T-cell epitope. According 

30 to a preferred embodiment, an epitope is determined in a protein or peptide which, when 
recognized by a T-cell, results in the proliferation of T-cells which is greater than the 
baseline. That T-cell epitope is then modified so that, when the peptide comprising the 
epitope is analyzed in the assay of the invention, it results in lesser proliferation than the 
protein comprising the unmodified epitope. More preferably, the epitope to be modified 

35 produces greater than three times the baseline T-cell proliferation in a sample. When 
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modified, the epitope produces less than three times the baseline T-cell proliferation, 
preferably less than two times the baseline T-cell proliferation and most preferably less 
than or substantially equal to the baseline T-cell proliferation in a sample. 

Preferably, the epitope is modified in one of the following ways: (a) the amino acid 
5 sequence of the epitope is substituted with an analogous sequence from a human 
homolog to the protein of interest, i.e., human subtilisin or another human protease 
derived subtilisin like molecule such as furin or the kexins (see e.g., Methods in 
Enzymology, Vol. 244., (1994) pp. 175 et $eq\ Roebroek et al., EMBO J., Vol. 5, No. 9, pp. 
2197-2202 (1986); Tomkinson et al., Biochem., Vol. 30, pp. 168-174 (1991); Keifer et al., 

10 DNA and Cell fi/o/., Vol. 10, No. 10, pp. 757-769 (1991)); (b) the amino acid sequence of 
the epitope is substituted with an analogous sequence from a non-human homolog to the 
protein of interest, which analogous sequence produces a lesser allergenic response due 
to T-cell recognition than that of the protein of interest; (c) the amino acid sequence of the 
epitope is substituted with a sequence which substantially mimics the major tertiary 

15 structure attributes of the epitope, but which produces a lesser allergenic response due to 
T-cell recognition than that of the protein of interest; or (d) with any sequence which 
produces lesser allergenic response due to T-cell recognition than that of the protein of 
interest. 

In a specific embodiment of the invention, a protease variant is provided 
20 comprising at least one amino acid substitution at a position corresponding to residues 
170, 171, 172 and/or 173 in BPN\ wherein such substitutions comprise modifying residue 
170 to aspartic acid, modifying residue 171 to glutamine, modifying residue 172 to 
methionine and/or modifying residue 173 to aspartic acid. In a most preferred 
embodiment, the substitution comprises modifying residues 170, 171 and 173 to aspartic 
25 acid, glutamine and aspartic acid, respectively. 

In another embodiment of the present invention, a method for producing the 
protein of the invention having reduced allergenicity is provided. Preferably, the mutant 
protein is prepared by modifying a DNA encoding a precursor protein so that the modified 
DNA encodes the mutant protein of the invention. 
30 In yet another embodiment of the invention, DNA sequences encoding the mutant 

protein, as well as expression vectors containing such DNA sequences and host cells 
transformed with such vectors are provided, which host cells are preferably capable of 
expressing such DNA to produce the mutant protein of the invention either intracellular^ 
or extracellularly. 
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The mutant protein of the invention is useful in any composition or process in 
which the precursor protein is generally known to be useful. For example, where the 
protein is a protease, the reduced allergenicity protease can be used as a component in 
cleaning products such as laundry detergents and hard surface cleansers, as an aid in the 
5 preparation of leather, in the treatment of textiles such as wool and/or silk to reduce 
felting, as a component in a personal care, cosmetic or face cream product, and as a 
component in animal or pet feed to improve the nutritional value of the feed. Similarly, 
where the protein is an amylase, the reduce allergenicity amylase can be used for the 
liquefaction of starch, as a component in a dishwashing detergent, for desizing of textiles, 
10 in a laundry detergent or any other use for which amylase is useful. 

One advantage of the present invention is that by measuring the proliferation of T- 
cells due to T-cell epitope recognition, it is possible to identify peptides which contain 
epitopes responsible for initially sensitizing an individual. That is, T-cell proliferation due to 
T-cell epitope recognition results in sensitization of an individual to that peptide or a 
15 protein which contains it. Neutralization of such "sensitizing" T-cell epitopes will inevitably 
result in a greater degree of safety for those who handle or are otherwise exposed to the 
antigen containing the epitope because they will not be initially sensitized, thus preventing 
the production of Ig antibodies typical of an allergic reaction upon subsequent exposure to 
the antigen. 

20 An advantage of the present invention is the preparation of proteins, including 

enzymes, which may be used with significantly less danger of sensitization for the 
individuals exposed. Thus, for example, the proteins of the invention may be more safely 
used in cosmetics such as face creams, detergents such as laundry detergents, hard 
surface cleaning compositions and pre-wash compositions or any other use of protein, 

25 including enzymes, wherein human exposure is a necessary by-product. 



BRIEF DESCRIPTION OF THE DRAWINGS 

Figs. 1 A, B1, B2 and B3 illustrate the DNA (SEQ ID:NO 1) and amino acid (SEQ 
ID: NO 2) sequence for Bacillus amyloliquefaciens subtilisin (BPN') and a partial restriction 
30 map of this gene. 

Fig. 2 illustrates the conserved amino acid residues among subtilisins from Bacillus 
amyloliquefaciens (SEQ ID: NO 3) and Bacillus lentus (wild-type) (SEQ ID: NO 4). 

Figs. 3A and 3B illustrate an amino acid sequence alignment of subtilisin type 
proteases from Bacillus amyloliquefaciens (BPN'), Bacillus subtilis, Bacillus licheniformis 



WO 99/53038 



PCT/US99/08253 



(SEQ ID:NO 5) and Bacillus lentus. The symbol * denotes the absence of specific amino 
acid residues as compared to subtilisin BPN\ 

Fig. 4 illustrates the additive T-cell response of 16 peripheral mononuclear blood 
5 samples to peptides corresponding to the Bacillus lentus protease. Peptide E05 includes 
the region comprising residues corresponding to 170-173 in protease from Bacillus 
amyloliquefaciens. 

Fig. 5 illustrates the additive T-cell response of 10 peripheral mononuclear blood 
samples to peptides corresponding to the human subtilisin molecule. Peptides F10, F9, 
10 F8 and F7 all contain the amino acid sequence DQMD corresponding to the region 

comprising residues corresponding to 170-173 in protease from Bacillus amyloliquefaciens 
in the sequence alignment of Fig. 3. 

Fig. 6A and 6B/6C illustrate amino acid strings corresponding to peptides derived 
from the sequence of Bacillus lentus protease and a human subtilisin, respectively. 
15 Fig. 7 illustrates the amino acid sequence of human subtilisin (SEQ ID:NO 6). 

Fig. 8 illustrates an amino acid sequence alignment of BPIST (Bacillus 
amyloliquefaciens) protease, SAVINASE (Bacillus lentus) protease and human subtilisin 
(S2HSBT). 

Fig. 9 illustrates the T-cell response to peptides derived from Bacillus lentus 
20 protease in a sample taken from an individual known to be hypersensitive to Bacillus 

lentus protease. Peptide EOS represents the region corresponding to 170-173 in protease 
from Bacillus amyloliquefaciens. 

Fig. 10 illustrates the T-cell response to various alanine substitutions in the E05 
Bacillus lentus protease peptide set in a sample taken from an individual known to be 
25 hypersensitive to Bacillus lentus protease. 

DETAILED DESCRIPTION OF THE INVENTION 

According to the present invention, a method for identifying T-cell epitopes is 
provided. The present invention provides an assay which identifies epitopes as follows: 
30 differentiated dendritic cells are combined with naTve human CD4+ and/or CD8+ T-cells 
and with a peptide of interest. More specifically, a method is provided wherein a T-cell 
epitope is recognized comprising the steps of: (a) obtaining from a single blood source a 
solution of dendritic cells and a solution of naTve CD4+ and/or CD8+ T-cells; (b) promoting 
differentiation in said solution of dendritic cells; (c) combining said solution of differentiated 
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dendritic cells and said naive CD4+ and/or CD8+ T-cells with a peptide of interest; (d) 
measuring the proliferation of T-cells in said step (c). 

The peptide of interest to be analyzed according to the assay of the invention is 
derived from a protein or enzyme for which reduced allergenicity is desirable or required. 
5 In the practice of the invention, it is possible to identify with precision the location of an 
epitope which can cause sensitization in an individual or sampling of individuals. In a 
particularly effective embodiment of the invention, a series of peptide oligomers which 
correspond to all or part of the protein or enzyme are prepared. For example, a peptide 
library is produced covering the relevant portion or atl of the protein. One particularly 

10 useful manner of producing the peptides is to introduce overlap into the peptide library, for 
example, producing a first peptide corresponds to amino acid sequence 1-10 of the 
subject protein, a second peptide corresponds to amino acid sequence 4-14 of the subject 
protein, a third peptide corresponds to amino acid sequence 7-17 of the subject protein, a 
fourth peptide corresponds to amino acid sequence 10-20 of the subject protein etc.... until 

15 representative peptides corresponding to the entire molecule are created. By analyzing 
each of the peptides individually in the assay provided herein, it is possible to precisely 
identify the location of epitopes recognized by T-cells. In the example above, the reaction 
of one specific peptide to a greater extent than it's neighbors will facilitate identification of 
the epitope anchor region to within three amino acids. After determining the location of 

20 these epitopes, it is possible to alter the amino acids within each epitope until the peptide 
produces a less significant T-cell response. 

"Antigen presenting cell" as used herein means a cell of the immune system which 
present antigen on their surface which is recognizable by receptors on the surface of T- 
cells. Examples of antigen presenting cells are dendritic cells, interdigitating cells, 

25 activated B-cells and macrophages. 

"T-cell proliferation" as used herein means the number of T-cells produced during 
the incubation of T-cells with the antigen presenting cells, with or without antigen. 

"Baseline T-cell proliferation" as used herein means T-cell proliferation which is 
normally seen in an individual in response to exposure to antigen presenting cells in the 

30 absence of peptide or protein antigen. For the purposes herein, the baseline T-cell 
proliferation level was determined on a per sample basis for each individual as the 
proliferation of T-cells in response to antigen presenting cells in the absence of antigen. 

"T-cell epitope" means a feature of a peptide or protein which is recognized by a T- 
cell receptor in the initiation of an immunologic response to the peptide comprising that 

35 antigen. Recognition of a T-cell epitope by a T-cell is generally believed to be via a 
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mechanism wherein T-cells recognize peptide fragments of antigens which are bound to 
class I or class II major histocompatability (MHC) molecules expressed on antigen- 
presenting cells (see e.g., Moeller, G. ed., Antigenic Requirements for Activation of MHC- 
Restricted Responses, Immunological Review, Vol. 98, p. 187 (Copenhagen; 
5 Munksgaard) (1987). 

The epitopes determined according to the assay provided herein are then modified 
to reduce the allergenic potential of the protein of interest. In a preferred embodiment, the 
epitope to be modified produces a level of T-cell proliferation of greater than three times 
the baseline T-cell proliferation in a sample. When modified, the epitope produces less 

10 than three times the baseline proliferation, preferably less than two times the baseline 
proliferation and most preferably less than or substantially equal to the baseline 
proliferation in a sample. 

Preferably, the epitope is modified in one of the following ways: (a) the amino acid 
sequence of the epitope is substituted with an analogous sequence from a human 

1 5 homolog to the protein of interest; (b) the amino acid sequence of the epitope is 

substituted with an analogous sequence from a non-human homolog to the protein of 
interest, which analogous sequence produces a lesser allergenic response due to T-cell 
epitope recognition than that of the protein of interest; (c) the amino acid sequence of the 
epitope is substituted with a sequence which substantially mimics the major tertiary 

20 structure attributes of the epitope, but which produces a lesser allergenic response due to 
T-cell epitope recognition than that of the protein of interest; or (d) with any sequence 
which produces lesser allergenic response due to T-cell epitope recognition than that of 
the protein of interest. 

"Sample" as used herein comprises mononuclear cells which are naive, i.e., not 

25 sensitized, to the antigen in question. 

"Homolog" as used herein means a protein or enzyme which has similar catalytic 
action, structure and/or use as the protein of interest. It is desirable to find a homolog that 
has a tertiary and/or primary structure similar to the protein of interest as replacement of 
the epitope in the protein of interest with an analogous segment from the homolog will 

30 reduce the disruptiveness of the change. Thus, closely homologous enzymes will provide 
the most desirable source of epitope substitutions. Alternatively, if possible, it is 
advantageous to look to human analogs for a given protein. For example, substituting a 
specific epitope in a bacterial subtilisin with a sequence from a human analog to subtilisin 
(i.e., human subtilisin) should result in less allergenicity in the bacterial protein. 
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An "analogous" sequence may be determined by ensuring that the replacement 
amino acids show a similar function, the tertiary structure and/or conserved residues to 
the amino acids in the protein of interest at or near the epitope. Thus, where the epitope 
region contains, for example, an alpha-helix or a beta-sheet structure, the replacement 

5 amino acids should maintain that specific structure. 

While the present invention extends to all proteins for which it is desired to reduce 
allergenicity, for the sake of simplicity, the following will describe a particularly preferred 
embodiment of the invention, the modification of protease. Proteases are carbonyl 
hydrolases which generally act to cleave peptide bonds of proteins or peptides. As used 

10 herein, "protease" means a naturally-occurring protease or a recombinant protease. 

Naturally-occurring proteases include a-aminoacylpeptide hydrolase, peptidyiamino acid 
hydrolase, acylamino hydrolase, serine carboxypeptidase, metallocarboxypeptidase, thiol 
proteinase, carboxylproteinase and metalloproteinase. Serine, metallo, thiol and acid 
proteases are included, as well as endo and exo-proteases. 

15 Subtilisins are bacterial or fungal proteases which generally act to cleave peptide 

bonds of proteins or peptides. As used herein, "subtilisin" means a naturally-occurring 
subtilisin or a recombinant subtilisin. A series of naturally-occurring subtilisins is known to 
be produced and often secreted by various microbial species. Amino acid sequences of 
the members of this series are not entirely homologous. However, the subtilisins in this 

20 series exhibit the same or similar type of proteolytic activity. This class of serine 
proteases shares a common amino acid sequence defining a catalytic triad which 
distinguishes them from the chymotrypsin related class of serine proteases. The 
subtilisins and chymotrypsin related serine proteases both have a catalytic triad 
comprising aspartate, histidine and serine. In the subtilisin related proteases the relative 

25 order of these amino acids, reading from the amino to carboxy terminus, is aspartate- 
histidine-serine. In the chymotrypsin related proteases, the relative order, however, is 
histidine-aspartate-serine. Thus, subtilisin herein refers to a serine protease having the 
catalytic triad of subtilisin related proteases. Examples include but are not limited to the 
subtilisins identified in Fig. 3 herein. Generally and for purposes of the present invention, 

30 numbering of the amino acids in proteases corresponds to the numbers assigned to the 
mature Bacillus amyloliquefaciens subtilisin sequence presented in Fig. 1. 

"Recombinant subtilisin" or "recombinant protease" refer to a subtilisin or protease 
in which the DNA sequence encoding the subtilisin or protease is modified to produce a 
variant (or mutant) DNA sequence which encodes the substitution, deletion or insertion of 

35 one or more amino acids in the naturally-occurring amino acid sequence. Suitable 
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methods to produce such modification, and which may be combined with those disclosed 
herein, include those disclosed in US Patent 4,760,025 (RE 34,606), US Patent 5,204,015 
and US Patent 5,185,258. 

"Non-human subtilisins" and the DNA encoding them may be obtained from many 
5 procaryotic and eucaryotic organisms. Suitable examples of procaryotic organisms 
include gram negative organisms such as £. coli or Pseudomonas and gram positive 
bacteria such as Micrococcus or Bacillus. Examples of eucaryotic organisms from which 
subtilisin and their genes may be obtained include yeast such as Saccharomyces 
cerevisiae, fungi such as Aspergillus sp. 

10 "Human subtilisin" means proteins of human origin which have subtilisin type 

catalytic activity, e.g., the kexin family of human derived proteases. An example of such a 
protein is represented by the sequence in Fig. 7. Additionally, derivatives or homologs of 
human subtilisin, including those from non-human sources such as mouse or rabbit, which 
retain the essential ability to hydrolyze peptide bonds and have at least 50%, preferably at 

1 5 least 65% and most preferably at least 80% homology to the protein of Fig. 7 are 
considered human subtilisins for the purpose of the invention. 

A "protease variant" has an amino acid sequence which is derived from the amino 
acid sequence of a "precursor protease". The precursor proteases include naturally- 
occurring proteases and recombinant proteases. The amino acid sequence of the 

20 protease variant is "derived" from the precursor protease amino acid sequence by the 
substitution, deletion or insertion of one or more amino acids of the precursor amino acid 
sequence. Such modification is of the "precursor DNA sequence" which encodes the 
amino acid sequence of the precursor protease rather than manipulation of the precursor 
protease enzyme perse. Suitable methods for such manipulation of the precursor DNA 

25 sequence include methods disclosed herein, as well as methods known to those skilled in 
the art (see, for example, EP 0 328299, WO89/06279 and the US patents and applications 
already referenced herein). 

The amino acid position numbers used herein refer to those assigned to the 
mature Bacillus amyloliquefaciens subtilisin sequence presented in Fig. 1. The invention, 

30 however, is not limited to the mutation of this particular subtilisin but extends to precursor 
proteases containing amino acid residues at positions which are "equivalent" to the 
particular identified residues in Bacillus amyloliquefaciens subtilisin. In a preferred 
embodiment of the present invention, the precursor protease is Bacillus lentus subtilisin 
and the substitutions, deletions or insertions are made at the equivalent amino acid 

35 residue in S. lentus corresponding to those listed above. 
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A residue (amino acid) of a precursor protease is equivalent to a residue of 
Bacillus amyloliquefaciens subtilisin if it is either homologous (i.e., corresponding in 
position in either primary or tertiary structure) or analogous to a specific residue or portion 
of that residue in Bacillus amyloliquefaciens subtilisin (i.e., having the same or similar 
5 functional capacity to combine, react, or interact chemically). 

In order to establish homology to primary structure, the amino acid sequence of a 
precursor protease is directly compared to the Bacillus amyloliquefaciens subtilisin primary 
sequence and particularly to a set of residues known to be invariant in subtilisins for which 
the sequence is known. For example, Fig. 2 herein shows the conserved residues as 

10 between B, amyloliquefaciens subtilisin and fl. lentus subtilisin. After aligning the 

conserved residues, allowing for necessary insertions and deletions in order to maintain 
alignment (i.e., avoiding the elimination of conserved residues through arbitrary deletion 
and insertion), the residues equivalent to particular amino acids in the primary sequence 
of Bacillus amyloliquefaciens subtilisin are defined. Alignment of conserved residues 

15 preferably should conserve 1 00% of such residues. However, alignment of greater than 
75% or as little as 50% of conserved residues is also adequate to define equivalent 
residues. Conservation of the catalytic triad, Asp32/His64/Ser221 should be maintained. 

For example, the amino acid sequence of subtilisin from Bacillus 
amyloliquefaciens, Bacillus subtilis, Bacillus licheniformis (carlsbergensis) and Bacillus 

20 lentus can be aligned to provide the maximum amount of homology between amino acid 
sequences. A comparison of these sequences shows that there are a number of 
conserved residues contained in each sequence. The conserved residues as between 
BPN' and B. lentus are identified in Fig. 2. 

These conserved residues, thus, may be used to define the corresponding 

25 equivalent amino acid residues of Bacillus amyloliquefaciens subtilisin in other subtilisins 
such as subtilisin from Bacillus lentus (PCT Publication No. W089/06279 published July 
13, 1989), the preferred protease precursor enzyme herein, or the subtilisin referred to as 
PB92 (EP 0 328 299), which is highly homologous to the preferred Bacillus lentus 
subtilisin. The amino acid sequences of certain of these subtilisins are aligned in Figs. 3A 

30 and 3B with the sequence of Bacillus amyloliquefaciens subtilisin to produce the maximum 
homology of conserved residues. As can be seen, there are a number of deletions in the 
sequence of Bacillus lentus as compared to Bacillus amyloliquefaciens subtilisin. Thus, 
for example, the equivalent amino acid for Val165 in Bacillus amyloliquefaciens subtilisin 
in the other subtilisins is isoleucine for B. lentus and B. licheniformis. 
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Thus, for example, the amino acid at position +170 is lysine (K) in both fi. 
amyloliquefaciens and B. licheniformis subtilisins and arginine (R) in Savinase. In one 
embodiment of the protease variants of the invention, however, the amino acid equivalent 
to +170 in Bacillus amyloliquefaciens subtilisin is substituted with aspartic acid (D). The 

5 abbreviations and one letter codes for all amino acids in the present invention conform to 
the Patentln User Manual (GenBank, Mountain View, CA) 1990, p. 101. 

"Equivalent residues" may also be defined by determining homology at the level of 
tertiary structure for a precursor protease whose tertiary structure has been determined by 
x-ray crystallography. Equivalent residues are defined as those for which the atomic 

10 coordinates of two or more of the main chain atoms of a particular amino acid residue of 
the precursor protease and Bacillus amyloliquefaciens subtilisin (N on N, CA on CA, C on 
C and O on O) are within 0.1 3nm and preferably 0.1 nm after alignment. Alignment is 
achieved after the best model has been oriented and positioned to give the maximum 
overlap of atomic coordinates of non-hydrogen protein atoms of the protease in question 

15 to the Bacillus amyloliquefaciens subtilisin. The best model is the crystallographic model 
giving the lowest R factor for experimental diffraction data at the highest resolution 
available. 



Z h \Fo(h)\-\Fc(h)\ 
RfiKW - UFo(h)\ 



Equivalent residues which are functionally analogous to a specific residue of 
20 Bacillus amyloliquefaciens subtilisin are defined as those amino acids of the precursor 
protease which may adopt a conformation such that they either alter, modify or contribute 
to protein structure, substrate binding or catalysis in a manner defined and attributed to a 
specific residue of the Bacillus amyloliquefaciens subtilisin. Further, they are those 
residues of the precursor protease (for which a tertiary structure has been obtained by x- 
25 ray crystallography) which occupy an analogous position to the extent that, although the 
main chain atoms of the given residue may not satisfy the criteria of equivalence on the 
basis of occupying a homologous position, the atomic coordinates of at least two of the 
side chain atoms of the residue lie with 0.1 3nm of the corresponding side chain atoms of 
Bacillus amyloliquefaciens subtilisin. The coordinates of the three dimensional structure of 
30 Bacillus amyloliquefaciens subtilisin are set forth in EPO Publication No. 0 251 446 
(equivalent to US Patent 5,182,204, the disclosure of which is incorporated herein by 
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reference) and can be used as outlined above to determine equivalent residues on the 
level of tertiary structure. 

Some of the residues identified for substitution, insertion or deletion are conserved 
residues whereas others are not. In the case of residues which are not conserved, the 
5 replacement of one or more amino acids is limited to substitutions which produce a variant 
which has an amino acid sequence that does not correspond to one found in nature. In 
the case of conserved residues, such replacements should not result in a naturally- 
occurring sequence. The protease variants of the present invention include the mature 
forms of protease variants, as well as the pro- and prepro- forms of such protease 

10 variants. The prepro- forms are the preferred construction since this facilitates the 
expression, secretion and maturation of the protease variants. 

"Prosequence" refers to a sequence of amino acids bound to the N-terminal 
portion of the mature form of a protease which when removed results in the appearance of 
the "mature" form of the protease. Many proteolytic enzymes are found in nature as 

15 translational proenzyme products and, in the absence of post-translational processing, are 
expressed in this fashion. A preferred prosequence for producing protease variants is the 
putative prosequence of Bacillus amyloliquefaciens subtilisin, although other protease 
prosequences may be used. 

A "signal sequence" or "presequence" refers to any sequence of amino acids 

20 bound to the N-terminal portion of a protease or to the N-terminal portion of a proprotease 
which may participate in the secretion of the mature or pro forms of the protease. This 
definition of signal sequence is a functional one, meant to include all those amino acid 
sequences encoded by the N-terminal portion of the protease gene which participate in 
the effectuation of the secretion of protease under native conditions. The present 

25 invention utilizes such sequences to effect the secretion of the protease variants as 
defined herein. One possible signal sequence comprises the first seven amino acid 
residues of the signal sequence from Bacillus subtilis subtilisin fused to the remainder of 
the signal sequence of the subtilisin from Bacillus lentus (ATCC 21536). 

A "prepro" form of a protease variant consists of the mature form of the protease 

30 having a prosequence operably linked to the amino terminus of the protease and a "pre" 
or "signal" sequence operably linked to the amino terminus of the prosequence. 

"Expression vector" refers to a DNA construct containing a DNA sequence which is 
operably linked to a suitable control sequence capable of effecting the expression of said 
DNA in a suitable host. Such control sequences include a promoter to effect transcription, 

35 an optional operator sequence to control such transcription, a sequence encoding suitable 
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mRNA ribosome binding sites and sequences which control termination of transcription 
and translation. The vector may be a plasmid, a phage particle, or simply a potential 
genomic insert. Once transformed into a suitable host, the vector may replicate and 
function independently of the host genome, or may, in some instances, integrate into the 
5 genome itself. In the present specification, "plasmid" and "vector" are sometimes used 
interchangeably as the plasmid is the most commonly used form of vector at present. 
However, the invention is intended to include such other forms of expression vectors 
which serve equivalent functions and which are, or become, known in the art. 

The "host cells" used in the present invention generally are procaryotic or 

10 eucaryotic hosts which preferably have been manipulated by the methods disclosed in US 
Patent 4,760,025 (RE 34,606) to render them incapable of secreting enzymatically active 
endoprotease. A preferred host cell for expressing protease is the Bacillus strain BG2036 
which is deficient in enzymatically active neutral protease and alkaline protease 
(subtilisin). The construction of strain BG2036 is described in detail in US Patent 

15 5,264,366. Other host cells for expressing protease include Bacillus subtilis 1168 (also 
described in US Patent 4,760,025 (RE 34,606) and US Patent 5,264,366, the disclosure of 
which are incorporated herein by reference), as well as any suitable Bacillus strain such 
as B. licheniformis, B. lentus, etc. 

Host cells are transformed or transfected with vectors constructed using 

20 recombinant DNA techniques. Such transformed host cells are capable of either 

replicating vectors encoding the protease variants or expressing the desired protease 
variant. In the case of vectors which encode the pre- or prepro-form of the protease 
variant, such variants, when expressed, are typically secreted from the host cell into the 
host cell medium. 

25 "Operably linked", when describing the relationship between two DNA regions, 

simply means that they are functionally related to each other. For example, a 
presequence is operably linked to a peptide if it functions as a signal sequence, 
participating in the secretion of the mature form of the protein most probably involving 
cleavage of the signal sequence. A promoter is operably linked to a coding sequence if it 

30 controls the transcription of the sequence; a ribosome binding site is operably linked to a 
coding sequence if it is positioned so as to permit translation. 

The genes encoding the naturally-occurring precursor protease may be obtained in 
accord with the general methods known to those skilled in the art. The methods generally 
comprise synthesizing labeled probes having putative sequences encoding regions of the 

35 protease of interest, preparing genomic libraries from organisms expressing the protease, 
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and screening the libraries for the gene of interest by hybridization to the probes. 
Positively hybridizing clones are then mapped and sequenced. 

The cloned protease is then used to transform a host cell in order to express the 
protease. The protease gene is then ligated into a high copy number plasmid. This 
5 plasmid replicates in hosts in the sense that it contains the well-known elements 
necessary for plasmid replication: a promoter operably linked to the gene in question 
(which may be supplied as the gene's own homologous promoter if it is recognized, i.e., 
transcribed, by the host), a transcription termination and polyadenylation region 
(necessary for stability of the mRNA transcribed by the host from the protease gene in 

1 0 certain eucaryotic host cells) which is exogenous or is supplied by the endogenous 
terminator region of the protease gene and, desirably, a selection gene such as an 
antibiotic resistance gene that enables continuous cultural maintenance of plasmid- 
infected host cells by growth in antibiotic-containing media. High copy number plasmids 
also contain an origin of replication for the host, thereby enabling large numbers of 

15 plasmids to be generated in the cytoplasm without chromosomal limitations. However, it is 
within the scope herein to integrate multiple copies of the protease gene into host 
genome. This is facilitated by procaryotic and eucaryotic organisms which are particularly 
susceptible to homologous recombination. 

In one embodiment, the gene can be a natural gene such as that from B lentus or 

20 B. amyloliquefaciens. Alternatively, a synthetic gene encoding a naturally-occurring or 
mutant precursor protease may be produced. In such an approach, the DNA and/or 
amino acid sequence of the precursor protease is determined. Multiple, overlapping 
synthetic single-stranded DNA fragments are thereafter synthesized, which upon 
hybridization and ligation produce a synthetic DNA encoding the precursor protease. An 

25 example of synthetic gene construction is set forth in Example 3 of US Patent 5,204,015, 
the disclosure of which is incorporated herein by reference. 

Once the naturally-occurring or synthetic precursor protease gene has been 
cloned, a number of modifications are undertaken to enhance the use of the gene beyond 
synthesis of the naturally-occurring precursor protease. Such modifications include the 

30 production of recombinant proteases as disclosed in US Patent 4,760,025 (RE 34,606) 
and EPO Publication No. 0 251 446 and the production of protease variants described 
herein. 

The following cassette mutagenesis method may be used to facilitate the 
construction of the protease variants of the present invention, although other methods 
35 may be used. First, the naturally-occurring gene encoding the protease is obtained and 
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sequenced in whole or in part. Then the sequence is scanned for a point at which it is 
desired to make a mutation (deletion, insertion or substitution) of one or more amino acids 
in the encoded enzyme. The sequences flanking this point are evaluated for the presence 
of restriction sites for replacing a short segment of the gene with an oligonucleotide pool 
5 which when expressed will encode various mutants. Such restriction sites are preferably 
unique sites within the protease gene so as to facilitate the replacement of the gene 
segment. However, any convenient restriction site which is not overly redundant in the 
protease gene may be used, provided the gene fragments generated by restriction 
digestion can be reassembled in proper sequence. If restriction sites are not present at 

10 locations within a convenient distance from the selected point (from 10 to 15 nucleotides), 
such sites are generated by substituting nucleotides in the gene in such a fashion that 
neither the reading frame nor the amino acids encoded are changed in the final 
construction. Mutation of the gene in order to change its sequence to conform to the 
desired sequence is accomplished by M13 primer extension in accord with generally 

15 known methods. The task of locating suitable flanking regions and evaluating the needed 
changes to arrive at two convenient restriction site sequences is made routine by the 
redundancy of the genetic code, a restriction enzyme map of the gene and the large 
number of different restriction enzymes. Note that if a convenient flanking restriction site 
is available, the above method need be used only in connection with the flanking region 

20 which does not contain a site. 

Once the naturally-occurring DNA or synthetic DNA is cloned, the restriction sites 
flanking the positions to be mutated are digested with the cognate restriction enzymes and 
a plurality of end termini-complementary oligonucleotide cassettes are ligated into the 
gene. The mutagenesis is simplified by this method because all of the oligonucleotides 

25 can be synthesized so as to have the same restriction sites, and no synthetic linkers are 
necessary to create the restriction sites. 

In one aspect of the invention, the objective is to secure a variant protease having 
altered allergenic potential as compared to the precursor protease, since decreasing such 
potential enables safer use of the enzyme. While the instant invention is useful to lower 

30 allergenic potential, the mutations specified herein may be utilized in combination with 
mutations known in the art to result altered thermal stability and/or altered substrate 
specificity, modified activity or altered alkaline stability as compared to the precursor. 



35 



Accordingly, the present invention is directed to altering the capability of the T-cell 
epitope which includes residue positions 170-173 in Bacillus lentus to induce T-cell 
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proliferation. One particularly preferred embodiment of the invention comprises making 
modification to either one or all of R170D, Y171Q and/or N173D. Similarly, as discussed 
in detail above, it is believed that the modification of the corresponding residues in any 
protease will result in a the neutralization of a key T-cell epitope in that protease. Thus, in 
5 combination with the presently disclosed mutations in the region corresponding to amino 
acid residues 170-173, substitutions at positions corresponding to 
N76D/S103A/V104I/G159D optionally in combination with one or more substitutions 
selected from the group consisting of positions corresponding to V68A, T213R, A232V, 
Q236H, Q245R, and T260A of Bacillus amyloliquefaciens subtilisin may be used, in 

10 addition to decreasing the allergenic potential of the variant protease of the invention, to 
modulate overall stability and/or proteolytic activity of the enzyme. Similarly, the 
substitutions provided herein may be combined with mutation at the Asparagine (N) in 
Bacillus lentus subtilisin at equivalent position +76 to Aspartate (D) in combination with the 
mutations S103AA/104I/G159D and optionally in combination with one or more 

15 substitutions selected from the group consisting of positions corresponding to V68A, 
T213R, A232V, Q236H, Q245R, and T260A of Bacillus amyloliquefaciens subtilisin, to 
produce enhanced stability and/or enhanced activity of the resulting mutant enzyme. 

The most preferred embodiments of the invention include the following specific 
combinations of substituted residues corresponding to positions: 

20 N76D/S 1 03AA/1 04I/G 1 59D/K1 70D/Y1 7 1 Q/S 1 73D; 

V68A/N76D/S1 03AA/1 04I/G1 59D/K1 70D/Y1 71 Q/S1 73D /Q236H; 
V68A/N76D/S1 03AA/1 04I/G1 59D/K1 70D/Y1 71 Q/S1 73D /Q236H/Q245R; 
V68A/N76D/S1 03AA/1 04I/G1 59D/K1 70D/Y1 71 Q/S1 73D/A232V/Q236H/Q245R; and 
V68A/N76D//S1 03AA/1 04I/G1 59D/K170D/Y1 71 Q/S1 73D/T21 3R/A232V/Q236H/ 

25 Q245R/T260A of Bacillus amyloliquefaciens subtilisin. These substitutions are preferably 
made in Bacillus lentus (recombinant or native-type) subtilisin, although the substitutions 
may be made in any Bacillus protease. 

Based on the screening results obtained with the variant proteases, the noted 
mutations noted above in Bacillus amyloliquefaciens subtilisin are important to the 

30 proteolytic activity, performance and/or stability of these enzymes and the cleaning or 
wash performance of such variant enzymes. 

Many of the protease variants of the invention are useful in formulating various 
detergent compositions. A number of known compounds are suitable surfactants useful in 
compositions comprising the protease mutants of the invention. These include nonionic, 

35 anionic, cationic, anionic or zwitterionic detergents, as disclosed in US 4,404,128 to Barry 
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J. Anderson and US 4,261 ,868 to Jiri Flora, et al. A suitable detergent formulation is that 
described in Example 7 of US Patent 5,204,015 (previously incorporated by reference). 
The art is familiar with the different formulations which can be used as cleaning 
compositions. In addition to typical cleaning compositions, it is readily understood that the 
5 protease variants of the present invention may be used for any purpose that native or wild- 
type proteases are used. Thus, these variants can be used, for example, in bar or liquid 
soap applications, dishcare formulations, contact lens cleaning solutions or products, 
peptide hydrolysis, waste treatment, textile applications, as fusion-cleavage enzymes in 
protein production, etc. The variants of the present invention may comprise, in addition to 

10 decreased allergenicity, enhanced performance in a detergent composition (as compared 
to the precursor). As used herein, enhanced performance in a detergent is defined as 
increasing cleaning of certain enzyme sensitive stains such as grass or blood, as 
determined by usual evaluation after a standard wash cycle. 

Proteases of the invention can be formulated into known powdered and liquid 

15 detergents having pH between 6.5 and 12.0 at levels of about .01 to about 5% (preferably 
.1% to .5%) by weight. These detergent cleaning compositions can also include other 
enzymes such as known proteases, amylases, cellulases, lipases or endoglycosidases, as 
well as builders and stabilizers. 

The addition of proteases of the invention to conventional cleaning compositions 

20 does not create any special use limitation. In other words, any temperature and pH 

suitable for the detergent is also suitable for the present compositions as long as the pH is 
within the above range, and the temperature is below the described protease's denaturing 
temperature. In addition, proteases of the invention can be used in a cleaning 
composition without detergents, again either alone or in combination with builders and 

25 stabilizers. 

The variant proteases of the present invention can be included in animal feed such 
as part of animal feed additives as described in, for example, US 5,612,055; US 
5,314,692; and US 5,147,642. 

One aspect of the invention is a composition for the treatment of a textile that 
30 includes variant proteases of the present invention. The composition can be used to treat 
for example silk or wool as described in publications such as RD 216,034; EP 134,267; 
US 4,533,359; and EP 344,259. 

The following is presented by way of example and is not to be construed as a 
limitation to the scope of the claims. 
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The variants can be screened for proteolytic activity according to methods well 
known in the art. Preferred protease variants include multiple substitutions at positions 
corresponding to: N76D/S103AA/104I/G159D/K170D/Y171Q/S173D; 
V68A/N76D/S1 03A/V1 04I/G1 59D/K1 70D/Y1 71 Q/S1 73D/Q236H ; 
5 V68A/N76D/S103AA/104I/G159D/K170D/Y171O/S173D/Q236H/Q245R; 

V68A/N76D/S 1 03A/V1 04I/G 1 59D/K1 70D/Y1 71 Q/S 1 73D/A232V/Q236H/Q245R; and 
V68A/N76D/S 1 03A/V1 04I/G1 59D/K1 70D/Y1 71 Q/S 1 73D/T2 1 3R/A232V/Q236H/Q245R/T2 
60A of Bacillus amyloliquefaciens subtilisin. 

All publications and patents referenced herein are hereby incorporated by 
1 0 reference in their entirety. 

EXAMPLES 
Example 1 

Assay for the Identification of Peptide T-Cell Epitopes 
15 Using Naive Human T-Cells 

Fresh human peripheral blood cells were collected from "naive" humans, i.e., 

persons not known to be exposed to or sensitized to Bacillus lentus protease, for 

determination of antigenic epitopes in protease from Bacillus lentus and human subtilisin. 

Naive humans is intended to mean that the individual is not known to have been exposed 

20 to or developed a reaction to protease in the past. Peripheral mononuclear blood cells 

(stored at room temperature, no older than 24 hours) were prepared for use as follows: 

Approximately 30 mis of a solution of buffy coat preparation from one unit of whole blood 

was brought to 50 ml with Dulbecco's phosphate buffered solution (DPBS) and split into 

two tubes. The samples were underlaid with 12.5 ml of room temperature lymphoprep 

25 density separation media (Nycomed density 1.077 g/ml). The tubes were centrifuged for 

thirty minutes at 600G. The interface of the two phases was collected, pooled and 

washed in DPBS. The cell density of the resultant solution was measured by 

hemocytometer. Viability was measured by trypan blue exclusion. 

From the resulting solution, a differentiated dendritic cell culture was prepared from 

30 the peripheral blood mononuclear cell sample having a density of 10 8 cells per 75 ml 

culture flask in a solution as follows: 

(1) 50 ml of serum free AIM V media (Gibco) was supplemented with a 

1:100 dilution beta-mercaptoethanol (Gibco). The flasks were laid flat for two 

hours at 37 °C in 5% CQ 2 to allow adherence of monocytes to the flask wall. 
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(2) Differentiation of the monocyte cells to dendritic cells was as 
follows: nonadherent cells were removed and the resultant adherent cells 
(monocytes) combined with 30 ml of AIM V, 800 units/ml of GM-CSF (Endogen) 
and 500 units/ml of IL-4 (Endogen); the resulting mixture was cultured for 5 days 

5 under conditions at 37 °C in 5% C0 2 . After five days, the cytokine TNFa 

(Endogen) was added to 0.2 units/ml, and the cytokine IL-la (Endogen) was 
added to a final concentration of 50 units/ml and the mixture incubated at 37°C in 
5% C0 2 for two more days. 

(3) On the seventh day, Mitomycin C was added to a concentration of 
10 50 microgram/ml was added to stop growth of the now differentiated dendritic cell 

culture. The solution was incubated for 60 minutes at 37 °C in 5% C0 2 . Dendritic 

cells were collected by gently scraping the adherent cells off the bottom of the flask 

with a cell scraper. Adherent and non-adherent cells were then centrifuged at 

600G for 5 minutes, washed in DPBS and counted. 
15 (4) The prepared dendritic cells were placed into a 96 well round 

bottom array at 2x10*/well in 100 microliter total volume of AIM V media. 

CD4+ T cells were prepared from frozen aliquots of the peripheral blood cell 
samples used to prepare the dendritic cells using the human CD4+ Cellect Kit (Biotex) as 
per the manufacturers instructions with the following modifications: the aliquots were 
20 thawed and washed such that approximately 10 8 cells will be applied per Cellect column; 
the cells were resuspended in 4 ml DPBS and 1 ml of the Cell reagent from the Cellect Kit, 
the solution maintained at room temperature for 20 minutes. The resultant solution was 
centrifuged for five minutes at 600G at room temperature and the pellet resuspended in 2 
ml of DPBS and applied to the Cellect columns. The effluent from the columns was 
25 collected in 2% human serum in DPBS. The resultant CD4+ cell solution was centrifuged, 
resuspended in AIMV media and the density counted. 

The CD4+ T-cell suspension was resuspended to a count of 2x10 8 /ml in AIM V 
media to facilitate efficient manipulation of the 96 well plate. 

Peptide antigen is prepared from a 1M stock solution in DMSO by dilution in AIM V 
30 media at a 1 : 1 0 ratio. 1 0 microliters of the stock solution is placed in each well of the 96 
well plate containing the differentiated dendritic cells. 100 microliter of the diluted CD4+ T- 
cell solution as prepared above is further added to each well. Useful controls include 
diluted DMSO blanks, and tetanus toxoid positive controls. 
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The final concentrations in each well, at 210 microliter total volume are as follows: 
2x1 0 4 CD4+ 

2x1 0 5 dendtritic cells (R:S of 10:1) 
5 mM peptide 

5 

Example 2 

Identification of T-Cell Epitopes in Protease 
from Bacillus lentus and Human subtilisin 

Peptides for use in the assay described in Example 1 were prepared based on the 

10 Bacillus lentus and human subtilisin amino acid sequence. Peptide antigens were 

designed as follows. From the full length amino acid sequence of either human subtilisin 
or Bacillus lentus protease provided in Figure 1, 15mers were synthetically prepared, each 
15mer overlapping with the previous and the subsequent 15mer except for three residues. 
Peptides used correspond to amino acid residue strings in Bacillus lentus as 

15 provided in Figure 8, and peptides correspond to amino acid residues in human subtilisin 
as provided in Figure 7. The peptides used corresponding to the proteases is provided in 
Fig. 6. All tests were performed at least in duplicate. All tests reported displayed robust 
positive control responses to the antigen tetanus toxoid. Responses were averaged within 
each experiment, then normalized to the baseline response. A positive event was 

20 recorded if the response was at least 3 times the baseline response. 

The immunogenic response (i.e., T-cell proliferation) to the prepared peptides from 
human subtilisin and Bacillus lentus was tallied and is provided in Figures 4 and 5, 
respectively. T-cell proliferation was measured by the incorporated tritium method. The 
results shown in Figures 4 and 5 as a comparison of the immunogenic additive response 

25 in 10 individuals (Figure 4) and 16 individuals (Figure 5) to the various peptides. 

Response is indicated as the added response wherein 1.0 equals a baseline response for 
each sample. Thus, in Figure 4, a reading of 10.0 or less is the baseline response and in 
Figure 5 a reading of 16.0 or less the baseline response. 

As indicated in Figures 4 and 5, the immunogenic response of the naive blood 

30 samples from unsensitized individuals showed a marked allergenic response at the 
peptide fragment from Bacillus lentus corresponding to residues 170-173 of Bacillus 
amyloliquefaciens protease. As expected, the corresponding fragment in human subtilisin 
evokes merely baseline response. 

Fig. 9 shows the T-cell response to peptides derived from Bacillus lentus protease 

35 in a sample taken from an individual known to be hypersensitive to Bacillus lentus 
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protease. Peptide E05 represents the region corresponding to 170-173 in protease from 
Bacillus amyloliquefaciens. As shown in Fig. 9, the hypersensitive individual was highly 
responsive to the T-cell epitope represented by the peptide EOS. This result confirms that, 
by practicing the assay according to the invention, it is possible to predict the major 
5 epitopes identified by the T-cells of a hypersensitive individual. 

Fig. 10 shows the T-cell response to various alanine substitutions in the E05 
peptide derived from Bacillus lentus protease in a sample taken from an individual known 
to be hypersensitive to Bacillus lentus protease. Alanine substitutions were used as 
substitutions for the purpose of determining the role of any specific residue within the 

10 epitope. The legend of Figure 10 refers to the position of the peptide in which an alanine 
was substituted, i.e., in peptide E06 (sequence GSISYPARYANAMAV), G to A = 2, S to A 
= 3, I to A = 4, S to A = 5, Y to A = 6, P to A = 7, R to A = 8, Y to A = 9, N to A = 10, M to 
A = 1 1 and V to A = 12. As indicated in Figure 10, substitution of either of the residues 
R170A, Y171A and/or N173A in protease from Bacillus lentus results in dramatically 

15 reduced response in the hypersensitive individual's blood sample. 

From these results, it is apparent that the residues 170, 171 and 173 are critical for 
T-cell response within this peptide. Accordingly, it is further apparent that these residues 
are largely responsible for the initiation of allergic reaction within the protease from 
Bacillus lentus. 



20 
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WE CLAIM: 

I. A protease variant comprising a substitution made at one or more of positions in 
a precursor protease corresponding to K170D, Y171Q and/or S173D of Bacillus 
amyloliquefaciens subtilisin. 

5 2. The protease variant according to claim 1, further comprising a substitution at 

one or more positions in a precursor protease equivalent to those selected from the group 
consisting of N76D, S103A, V104I, G159D, V68A, T213R, A232V, Q236H, Q245R, and 
T260A. 

3. The protease variant according to claim 2 which is derived from a Bacillus 
10 subtilisin. 

4. The protease variant according to claim 3 which is derived from Bacillus lentus 
subtilisin or Bacillus amyloliquefaciens subtilisin. 

5. A DNA encoding a protease variant of claim 1 . 

6. An expression vector encoding the DNA of claim 5. 

15 7. A host cell transformed with the expression vector of claim 6. 

8. A cleaning composition comprising the protease variant of claim 1. 

9. An animal feed comprising the protease variant of claim 1 . 

10. A composition for treating a textile comprising the protease variant of claim 1. 

I I. A protease variant according to claim 1, comprising combined substitution sets 
20 selected from the group consisting of positions corresponding to K170D/Y171Q/S173D; 

N76D/S103AA/104I/G159D/ K170D/Y171Q/S173D; V68A/N76D/S103A/V104I/G159D/ 
K1 70D/Y 1 71 Q/S1 73D/Q236H; V68A/N76D/S 1 03AA/1 04I/G 1 59D/ 
K1 70D/Y1 71 Q/S 1 73D/Q236H/Q245R; 
V68A/N76D/S1 03AA/1 04I/G1 59D/K1 70D/Y1 7 1 Q/S1 73D/ 
25 A232V/Q236H/Q245R; and 

V68A/N76D/S1 03AA/1 04I/G1 59D/K1 70D/Y1 7 1 Q/S1 73D/T21 3R/A232V/Q236H/ 
Q245R/T260A of Bacillus amyloliquefaciens subtilisin. 
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12. A method for determining T-cell epitopes in humans comprising the steps of: 

(a) obtaining from a single blood source a solution of dendritic cells and a solution 
of naive CD4+ and/or CD8+ T-cells; 

(b) promoting differentiation in said solution of dendritic cells; 

5 (c) combining said solution of differentiated dendritic cells and said naive CD4+ 

and/or CD8+ T-cells with a peptide of interest; 

(d) measuring the production of antibodies in said step (c). 

13. A method of reducing the allergenicity of a protein comprising the steps of: 
(a) identifying a T-cell epitope in said protein; 

10 (b) modifying said protein to neutralize said T-cell epitope. 

14. The method according to claim 13, wherein said epitope is modified by: 

(a) substituting the amino acid sequence of the epitope with an analogous 
sequence from a human homolog to the protein of interest; 

(b) substituting the amino acid sequence of the epitope with an analogous 
15 sequence from a non-human homolog to the protein of interest, which analogous 

sequence produces a lesser allergenic response from T-cells than that of the protein of 
interest; or 

(c) substituting the amino acid sequence of the epitope with a sequence which 
substantially mimics the major tertiary structure attributes of the epitope, but which 

20 produces a lesser allergenic response from T-cells than that of the protein of interest. 

15. A protein having reduced allergenicity made by the method according to claim 

14. 

16. A protein having reduced allergenicity, wherein said protein comprises a 
modification comprising the substitution or deletion of amino acid residues which are 

25 identified as within a T-cell epitope according to the assay provided in claim 1 3. 
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